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We present a measurement of the tt production cross section using events with one charged lepton 
and jets from pp collisions at a center-of-mass energy of 1.96 TeV. A 6-tagging algorithm based on 
the probability of displaced tracks coming from the event interaction vertex is applied to identify 
b quarks from top decay. Using 318 pb" 1 of data collected with the CDF II detector, we measure 
the tt production cross section in events with at least one restrictive (tight) b-tagged jet and obtain 
8.9i^o(stat.)iJ;J(syst.) pb. The cross section value assumes a top quark mass of m t = 178 GeV/c 2 
in the acceptance corrections. The dependence of the cross section on m t is presented in the 
paper. This result is consistent with other CDF measurements of the tt cross section using different 
samples and analysis techniques, and has similar systematic uncertainties. We have also performed 
consistency checks by using the fa-tagging probability function to vary the signal to background ratio 
and also using events that have at least two 6-tagged jets. 

PACS numbers: 14.65.Ha, 13.85.Ni, 13.85.Qk 



I. Introduction 

The top quark is the most massive fundamental 
particle observed so far, and the study of its proper- 
ties is interesting for several reasons ranging from its 
possible special role in electroweak symmetry break- 
ing to its sensitivity to physics beyond the standard 
model (SM). In particular, the measurement of the 
top quark pair production cross section a t i is of in- 
terest as a test of QCD predictions. Recent QCD 
calculations done with perturbation theory to next- 
to-leading order predict a t i with an uncertainty of 
less than 15% [1, 2], which motivate measurements 
of comparable precision. 

Top quark pairs in the SM are produced via ei- 
ther quark-antiquark annihilation or gluon-gluon fu- 
sion in hadron colliders. At the Fermilab Tevatron 
collider, with a center-of-mass energy of 1.96 TeV 
in pp collisions, about 85% of the total top pair 
production comes from quark-antiquark annihila- 
tion. At this center-of-mass energy, the calculated 
cross section, for the combined Tevatron Run I top 
mass of 178 GcV/c 2 [3], is 6.1±g;f pb [1] and de- 
creases by approximately 0.2 pb for each increase 
of 1 GeV/c 2 in the value of the top mass over the 
range 170 GeV/c 2 < m t < 190 GcV/c 2 . The stan- 
dard model top quark decays to a W boson and a b 
quark almost 100% of the time, resulting in a final 
state from tt production of two W bosons and two b 
jets from b quark fragmentation. When one W decays 
leptonically and the other W decays to quarks, the 
tt event typically contains a high momentum charged 
lepton, an undetected neutrino and four high trans- 
verse momentum jets, two of which originate from b 
quarks. The undetected neutrino results in an im- 
balance of the transverse energy of the event, labeled 
as "missing Et v {$t)- This decay mode is called 
"lepton+jets". 

In this paper, we report a measurement of the cross 
section for pair production of top quarks in the lep- 
ton+jets channel in 318 pb -1 of pp collision data at 



y/s — 1.96 TeV. The data were recorded between 
March 2002 and August 2004, during Run II of the 
Tevatron, by the CDF II detector, a general purpose 
detector which combines charged particle trackers, 
sampling calorimeters, and muon detectors. Pro- 
cesses in which a W boson is produced in associa- 
tion with several jets with large transverse momen- 
tum can be misidentihed as tt, since they have the 
same signature. In order to separate the tt events 
from this background, we develop a method to tag 
6-jcts based on tracking information from the silicon 
detector. The main event selection requires at least 
one tight (more restrictive) b tag in the event. As 
a cross check, we also measure the cross section u- 
sing events with a loose (less restrictive) b tag and 
events which have at least two tight or at least two 
loose b tags. Background contributions from heavy 
flavor production processes, such as Wbb, Wcc or Wc, 
misidentihed W bosons, electroweak processes, single 
top production, and mistagged jets are estimated u- 
sing a combination of Monte Carlo calculations and 
independent measurements in control data samples. 
An excess over background in the number of events 
that contain a lepton, missing energy and three or 
more jets with at least one &-tag is assumed to be 
a signal of tt production and is used to measure the 
production cross section <j t i- 

Previous measurements [4] at ^/s = 1.8 TeV gave 
a production cross section consistent with the stan- 
dard model prediction. Recent CDF measurements 
at y/s = 1.96 TeV are reported in Refs. [5-9] and 
use different techniques and top decay channels. The 
measurement described here analyzes more data than 
the above, and uses a jet probability 6-tagging algo- 
rithm. A feature of this algorithm is that 6-tagging 
is based on a continuous probability function rather 
than on a discrete object such as a secondary vertex. 
Potentially, this tagger can also be used to statisti- 
cally separate b and c heavy flavor contributions. 

The organization of this paper is as follows. Sec- 
tion II reviews the detector systems relevant to this 
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analysis. In Section III, we describe the data sample 
and event reconstruction. The ^-tagging algorithm 
and its efficiency and misidcntification ("fake") rate 
are discussed in Section IV. Section V describes the 
event selection. The estimate of the different back- 
grounds is presented in Section VI. The tt event ac- 
ceptance and tagging efficiency are derived in Sec- 
tion VII. The tt production cross section measure- 
ments in single and double tagged events are reported 
in Sections VIII, and IX, respectively. Finally, the 
conclusions are presented in Section X. 



II. THE CDF II DETECTOR 

The CDF II detector uses a cylindrical coordinate 
system with the z coordinate along the proton direc- 
tion, the azimuthal angle <p, and the polar angle 9 
usually expressed in terms of the pseudo-rapidity r\ 
= -ln[tan(#/2)]. The rectangular coordinates x and 
y point radially outward and vertically upward from 
the Tevatron ring, respectively The detector has 
been described in detail elsewhere [10]. In this sec- 
tion, we give a brief description of the parts relevant 
for the analysis. 

Tracking systems are essential to trigger on and 
identify high momentum charged particles such as 
electrons and muons. The charged particle tracking 
detectors are contained in a superconducting solenoid 
which generates a magnetic field of 1.4 T, oriented 
parallel to the proton beam direction. The Central 
Outer Tracker (COT) [11] is a 3.1 m long open cell 
drift chamber which performs up to 96 track posi- 
tion measurements in the region between 0.40 m and 
1.37 m from the beam axis. Sense wires are arranged 
in 8 alternating axial and ± 2° stereo superlayers 
with 12 wires each. The position resolution of a sin- 
gle drift time measurement is approximately 140 /im. 
For high momentum tracks, the COT transverse mo- 
mentum resolution is u VT jp\ w 0.0017 GeV/c. 

Inside the inner radius of the COT, a five layer 
doubled-sided silicon microstrip detector (SVX) [12] 
covers the region between 2.5 cm to 11 cm from the 
beam axis. Three separate SVX barrel modules along 
the beamline cover a length of 96 cm, approximately 
90% of the luminous beam intersection region. Three 
of the five layers combine an r-<j> measurement on one 
side and a 90° stereo measurement on the other, and 
the remaining two layers combine r-<f> with a small 
stereo angle of ±1.2°. Silicon microstrips have a pitch 
of 60 to 65 /jm depending on the layer. Three addi- 
tional Intermediate Silicon Layers (ISL) [13] at radii 
between 19 cm and 30 cm in the central region link 
tracks in the COT to hits in SVX. 



Electromagnetic (EM) and hadronic (HAD) sam- 
pling calorimeters [14-16] surround the tracking sys- 
tem and measure the energy flow of interacting parti- 
cles in the pseudo-rapidity range \rj\ < 3.64. The EM 
and HAD calorimeters are lead-scintillator and iron- 
scintillator sampling devices, respectively. They are 
segmented into projective towers, each one covering 
a small range in pseudo-rapidity and azimuth. Most 
towers cover 15 degrees in <f> and 0.10 to 0.13 units in 
pseudo-rapidity. Proportional chambers (CES) mea- 
sure the transverse profile of EM showers at a depth 
corresponding to the shower maximum for electrons. 
Electrons are reconstructed in the central electromag- 
netic calorimeter (CEM) with a transverse energy 
precision cr(E T )/E T = 13.5%/ y/E T /GeV 2% [14]. 
Jets are identified as a group of electromagnetic and 
hadronic calorimeter towers with an energy resolu- 
tion of approximately 0.1- E T + 1-0 GeV [17]. 

The muon system is located outside of the 
calorimeters. Four layers of planar drift chambers 
(CMU) [18] detect muons with p T > 1.4 GeV/c that 
penetrate the five absorption lengths of calorimeter 
steel in the central region of \rj\ < 0.6. An additional 
four layers of planar drift chambers (CMP) [19] lo- 
cated behind 0.6 m of steel outside the magnet re- 
turn yoke detect muons with px > 2.0 GeV/c. When 
a track is linked to both CMU and CMP, it is called 
a CMUP muon. The Central Muon Extension detec- 
tor (CMX), arranged in a conical geometry, provides 
muon detection in the region 0.6 < \rj\ < 1.0 with 
four to eight layers of drift chambers, depending on 
the polar angle. All the muon chambers measure the 
azimuthal coordinates of hits via a drift time mea- 
surement. The CMU and CMX also measure the 
longitudinal coordinate, z. 

The beam luminosity is determined by using gas 
Cherenkov counters [20] located in the region 3.7 < 
\r)\ < 4.7 which measure the average number of in- 
elastic pp collisions per bunch crossing. The total 
uncertainty on the luminosity is 5.9%, where 4.4% 
comes from the acceptance and operation of the lu- 
minosity monitor and 4.0% from the calculation of 
the inelastic pp cross section [21]. 



III. DATA SAMPLE AND EVENT 
RECONSTRUCTION 

The data used in this analysis are from pp collisions 
at a center-of-mass energy of ^fs =1.96 TeV recorded 
by the CDF II detector between March 2002 and 
September 2004. The data sample has been collected 
by triggers based on the selection of a high transverse 
momentum lepton (electron or muon). The total in- 
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tegrated luminosity is 318 pb^ 1 for CEM electron 
and CMUP muon candidates, and 305 pb _1 for CMX 
muon candidates. Briefly, we discuss the trigger and 
lepton identification requirements, the reconstruction 
of jets, and the missing transverse energy, $ T . 

CDF has a three-level trigger system to filter events 
from a 2.5 MHz beam crossing rate down to 60 Hz 
for permanent storage. The first two levels of triggers 
are special purpose hardware and the third consists 
of a farm of computers. 

The first trigger level (LI) reconstructs charged 
particle tracks in the COT r-cfi projection using a 
hardware track processor called the Extremely Fast 
Tracker (XFT) [22]. The LI electron trigger requires 
a XFT track with p T > 8 GeV/c matched to an EM 
calorimeter tower with Et > 8 GeV and with a ra- 
tio of hadronic-to-electromagnetic energy less than 
0.125. The LI muon trigger requires an XFT track 
with pt > 4 GeV/c matched to a muon track seg- 
ment with px > 6 GeV/c from the CMU and CMP 
chambers or a track with pt > 8 GeV/c matched 
to a muon track segment with pt > 6 GeV/c in the 
CMX chambers. 

The second level (L2) electron trigger requires the 
XFT track found at LI to be matched to a cluster 
of energy in the central EM calorimeter with Et > 
16 GeV. The cluster adds the energy of the neigh- 
boring trigger towers with Et > 7.5 GeV to the 
original LI trigger tower. A trigger tower consists of 
two calorimeter towers. The L2 muon trigger accepts 
events passing LI. 

The third trigger level (L3) is a farm of Linux com- 
puters which perform on-line event reconstruction, 
including 3D charged particle reconstruction. The L3 
electron trigger requires a track with pt > 9 GeV/c 
matched to an energy cluster of three adjacent tow- 
ers in pseudo-rapidity in the central EM calorimeter 
with Et > 18 GeV, consistent with the shower pro- 
file expected from test beam electrons. The L3 muon 
trigger requires a track with pt > 18 GeV/c matched 
to a track segment in the muon chambers within 
10 cm in the r — <fi view and, for CMU and CMX 
muons only, within 20 cm in the z view. The effi- 
ciency of these triggers is measured using — > e^v 
and Z — > fi + ' pT data (the method is described in 
Ref. [23]) and is found to be (96.2 ± 0.6)% for CEM 
electrons, and (90.8 ± 0.5)% and (96.5 ± 0.4)% for 
CMUP and CMX muons respectively, for electrons 
and muons passing through the fiducial volume of 
these detectors. 



A. Track and Primary Vertex Reconstruction 

The trajectories of charged particles are found (in 
a first approximation) as a series of segments in the 
axial superlayers of the COT. Two complementary al- 
gorithms associate the segments lying on a common 
circle to define an axial track. Segments in the stereo 
layers are associated with the axial tracks to recon- 
struct 3D tracks. For muons and electrons used in 
this analysis, COT tracks are required to have at 
least 3 axial and 2 stereo segments with at least 5 
hits per superlayer. The efficiency for finding iso- 
lated high momentum COT tracks in the COT fidu- 
cial volume with pt > 10 GeV/c is measured using 
electrons from — > e ± f events and is found to be 
(98.3 ± 0.1)%. Silicon hit information is added to 
reconstructed COT tracks using an "outside-in" tra- 
cking algorithm. The COT tracks are extrapolated 
to the silicon detector and the track is refit using 
the information from the silicon measurements. The 
initial track parameters provide a width for a search 
region in a given layer. For each candidate hit in that 
layer, the track is refit and used to define the search 
region into the next layer. The search uses the two 
best candidate hits in each layer to generate a small 
tree of final track candidates, and the one with the 
best fit x 2 is selected. The efficiency to associate at 
least three silicon hits with an isolated COT track is 
found to be (91 ± 1)%. 

The primary vertex location for a given event is 
found by fitting well-measured tracks to a common 
point of origin. At high luminosities, more than one 
collision can occur on a given bunch crossing. For a 
luminosity of ^10 32 cm _2 s _1 , there are ~2.3 inter- 
actions per bunch crossing. The luminous region is 
long, with (j z = 29 cm; therefore the primary ver- 
tices of each collision are typically separate in z. The 
first estimate of the primary vertices (xy, yv, zy) 
is binned in the z coordinate, and the z position of 
each vertex is then calculated from the weighted av- 
erage of the z coordinate of all tracks within 1 cm 
of the first iteration vertex, with a typical resolution 
of 100 fim. The primary vertex is determined event 
by event by an iterative algorithm which uses tracks 
around a seed vertex, defined as above, to form a new 
vertex. The \ 2 f°r an tracks relative to the new ver- 
tex is calculated, tracks with bad x 2 are removed, and 
the cycle is repeated until all tracks have a good x 2 . 
The locus of all primary vertices defines the beamline, 
the position of the luminous region of the beam-beam 
collisions through the detector. A linear fit to (xy, 
yy) vs. zy yields the beamline for each stable ru- 
nning period. The beamline is used as a constraint to 
refine the knowledge of the primary vertex in a given 
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event. The transverse beam cross section is circular, 
with a rms width of w 30 /zm at z = 0, rising to w 50 
- 60 ^im at \z\ =40 cm. The beam is not necessarily 
parallel nor centered in the detector. 



B. Electron Identification 

Electron reconstruction begins with a track with 
Pt > 9 GeV/c that extrapolates to a cluster of three 
CEM towers adjacent in pseudo-rapidity with a total 
Et > 20 GcV. Several cuts are successively applied 
in order to improve the purity of the electron selec- 
tion, as summarized in Table I. Electron candidates 
passing these requirements are called tight electrons. 

The ratios between the hadronic and the electro- 
magnetic cluster energies Ehad / Eem and between 
the cluster energy and the track momentum E/p are 
required to be consistent with an electron's energy 
deposition in the calorimeters. The cluster is further 
required to be isolated, the isolation I being defined 
as the ratio of the add itional transver se energy in a 
cone of radius R = \J (A<j)) 2 + (Ar/) 2 = 0.4 around 
the cluster to the transverse energy of the cluster it- 
self. 

The position of the electromagnetic shower mea- 
sured by the CES detector is used to define matching 
requirements between the extrapolated track and the 
cluster in the CES x and z local coordinates. In par- 
ticular, a charge dependent cut in the x position is 
applied to take into account the different flow of en- 
ergy deposited by bremsstrahlung photons emitted 
by an electron or a positron. In addition, the CES 
provides electron identification through the observed 
shower shape. The CES shower shape is fitted in 
the z view to the distribution expected for an elec- 
tron, and the chisquare probability for the fit, xltripi 
is used as a cut on the shower profile. Finally, the 
sharing of energy between adjacent calorimeter tow- 
ers is quantified by the lateral shower profile L s h r , 
which measures how close the energy distribution in 
the CEM towers adjacent to the cluster seed is to the 
electron hypothesis. 

Electrons from photon conversions throughout the 
detector material are vetoed by rejecting electron 
candidates if an oppositely charged track with a small 
distance of closest approach (D) is found. This anal- 
ysis is sensitive to any loss in efficiency from the 
misidentification of an electron from the W boson 
decay as a photon conversion. Therefore, in order 
to avoid loss of efficiency, the veto is not applied to 
events consistent with electrons radiating a photon 
that subsequently converts. The performance of this 
algorithm to identify electrons from photon conver- 



sions is estimated to be (72.6 ± 0.1)% [9], where the 
uncertainty covers both statistical and systematic. 

The efficiency of the electron selection on ti events 
is determined by means of Monte Carlo simulation. 
Studies of Z — > e + e~ processes show that a data to 
Monte Carlo simulation scale factor of (99.6 ^05)% 
is needed to correct the simulation predictions for the 
efficiency for CEM electron identification. 

Other electron categories are defined. Candidate 
electrons passing all the above requirements except 
for the isolation cut are called loose electrons. Tracks 
matched to an energy deposit in the plug calorimeter 
(1.2 < < 2.0) are called plug electrons. 



C. Muon Identification 

Muon identification starts by requiring an isolated, 
high momentum COT track that extrapolates to a 
track segment in the muon chambers. Several addi- 
tional requirements are imposed in order to minimize 
contamination from hadrons punching through the 
calorimeter, decays in flight of charged hadrons and 
cosmic rays. Table II lists the selection requirements 
for candidate muons. Muon candidates passing these 
cuts are called tight muons. 

The COT track must have p T > 20 GeV/c, and at 
least 3 axial and 2 stereo segments with a minimum 
of 5 hits per segment. The distance of closest appro- 
ach of the track to the beamline in the transverse 
plane, do, must be small in order to select prompt 
muons (coming from the interaction primary vertex) 
and reject cosmics and in-flight decays. The energy 
deposition in the EM and HAD calorimeters, -Eem 
and -Ehad, must be small as expected for the pa- 
ssage of a minimum ionizing particle. The distance 
between the extrapolated COT track and the track 
segment in the muon chambers, Ax, must be small in 
order to ensure a good match. If a track is matched 
to a CMU segment, a matching CMP segment is also 
required, and vice versa. Isolation is defined as the 
ratio between any additional transverse energy in a 
cone of radius R = 0.4 around the track direction 
and the muon pr, and it is required to be smaller 
than 0.1. Cosmic rays are efficiently identified and 
rejected through their asynchronous track timing re- 
lative to the beam crossing time and their incoming 
and outgoing back-to-back track topology. 

Studies of Z — ► fi + \i~ processes show that a data to 
Monte Carlo simulation scale factor of (87.4 ± 0.9)% 
((98.9 ± 0.6)%) is needed to correct the simulation 
predictions for the CMUP (CMX) muon identifica- 
tion efficiency. 

As for the electrons, candidate muons passing all 
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TABLE I: Selection requirements for tight electrons. 



Electron Variable Cut 

~p? > 10 GeV/c 

E T > 20 GeV 

COT Axial Segments > 3 

COT Stereo Segments > 2 

Hits for Each COT Segment > 5 

E H ad /Eem < 0.055 + 0.00045 x E 

E/p < 2.0 unless p T > 50 GeV/c 

Isolation < 0.1 

L s hr < 0.2 

CES \Az\ < 3.0 cm 

CES Q x |Ax| -3.0 < Q x \Ax\ < 1.5 cm 

CES Xlrip < 10 

Photon Conversions Veto if D < 0.2 cm and Acot(g) < 0.04 



TABLE II: Selection requirements for tight muons. 



Muon Variable Cut 

'vt > 20 GeV/c 

COT Axial Segments > 3 

COT Stereo Segments > 2 

Hits for Each COT Segment > 5 

\do\ < 0.2 cm if no silicon hits 

\do\ < 0.02 cm if silicon hits 
E H ad < max(6, 6 + 0.0280(p - 100)) GeV 
Eem < max(2, 2 + 0.0115(p — 100)) GeV 

CMU \Ax\ < 3.0 cm 

CMP |Ax| < 5.0 cm 

CMX | Ax | < 6.0 cm 

Isolation < 0.1 

Cosmic Rays Veto 



the cuts except the isolation cut are called loose 
muons. A track matched to a CMU or a CMP seg- 
ment only, which passes all the other cuts including 
isolation, is also accepted as a loose muon. 



D. Jet Reconstruction and Corrections 

The jets used in this analysis are reconstructed 
from calorimeter towers using a cone algorithm [24] 
with a radius R < 0.4, for which the E T of each 
tower is calculated with respect to the z coordinate 
of the event. The calorimeter towers belonging to any 
electron candidate are not used by the jet clustering 
algorithm. The energy of the jets is corrected [25] 



for the pseudo-rapidity dependence of the calorime- 
ter response, the calorimeter time dependence, and 
extra Et from any multiple interactions. 

By definition, tight jets have corrected Et > 
15 GeV and detector |fy| < 2.0, whereas loose jets 
have corrected Et > 8 GeV and detector |f?| < 2.0. 
Detector 77 is the pseudo-rapidity of the jet calculated 
with respect to the center of the detector. 



E. Missing Transverse Energy Reconstruction 

The presence of neutrinos in an event is inferred 
by an imbalance of transverse energy in the detector. 
The missing transverse energy, is defined as the 
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magnitude of — ^2 i [ET,iCos(fa), ET,iSin(fa)], where 
Ej-^i is the transverse energy of the calorimeter tower 
i calculated with respect to the z coordinate of the 
event, fa is its azimuthal angle, and the sum is over all 
calorimeter towers. The $ T is corrected by subtract- 
ing the transverse momentum of the muon track and 
adding back the transverse energy in the calorimeter 
towers traversed by the muon. Because the $ T cal- 
culation uses all calorimeter towers, the If T vector 
is adjusted for the effect of the jet corrections for all 
jets with E T > 8 GeV and < 2.5. 



F. Monte Carlo Samples and Detector 
Simulation 

The understanding of acceptances, efficiencies and 
backgrounds relics on detailed simulation of physics 
processes and detector response. 

The detector acceptance for tt events is modeled 
using PYTHIA v6.2 [26] and HERWIG v6.4 [27]. This 
analysis uses the former for the final cross section es- 
timate and the latter to estimate the systematics due 
to differences in the modeling of ti production and 
decay. These generators employ leading order ma- 
trix elements for the hard parton scattering, followed 
by parton showering to simulate gluon radiation and 
fragmentation effects. The generators are used with 
the CTEQ5L parton distribution functions [28]. Decays 
of b and c hadrons are modeled using QQ v9.1 [29]. 
Estimates of backgrounds from W bosons produced 
in association with jets are derived using the ALPGEN 
generator [30] with parton showering provided by 
HERWIG. The background from electroweak processes 
and single top production is studied using PYTHIA. 

The CDF II detector simulation reproduces the 
response of the detector and uses the same detec- 
tor geometry database as the event reconstruction. 
Particle interactions through matter are performed 
with GEANT3 [31]. Charge deposition in the silicon 
detectors is calculated using a simple geometrical 
model based on the path length of the ionizing par- 
ticle and an unrestricted Landau distribution. The 
drift model for the COT uses a parametrization of a 
GARFIELD [32] simulation with parameters tuned to 
match COT collider data [11]. The calorimeter simu- 
lation uses the GFLASH [33] parametrization package 
interfaced with GEANT3. The GFLASH parameters are 
tuned to test beam data for electrons and high-p^ pi- 
ons and they are checked by comparing the calorime- 
ter energy of isolated tracks in collision data to their 
momentum as measured in the COT. More details on 
the CDF II simulation can be found in Ref. [34]. 



IV. JET PROBABILITY 6-TAGGING 
ALGORITHM 

The jet probability 6-tagging algorithm [35] is used 
to determine whether a jet has been produced from 
the hadronization process of a light parton or a heavy 
parton. The latter result in long-lived hadrons whose 
decay gives rise to tracks displaced from the primary 
interaction vertex. This algorithm uses tracks asso- 
ciated with a jet to determine the probability for 
these to come from the primary vertex of the inter- 
action. The calculation of the probability is based 
on the impact parameters (do) of the tracks in the 
jet and their uncertainties. The impact parameter is 
assigned a positive or negative sign depending on the 
position of the track's point of closest approach to 
the primary vertex with respect to the jet direction, 
as shown in Fig. 1. By construction, the probability 
for tracks originating from the primary vertex is uni- 
formly distributed from to 1. For a jet coming from 
heavy flavor hadronization, the distribution peaks at 
0, due to tracks from long lived particles that have a 
large impact parameter with respect to the primary 
vertex. 




Track 2: dj is negatively signed 

FIG. 1: The sign of the impact parameter of a track. 
The impact parameter is positive (negative) if the angle <f> 
between the jet axis and the line connecting the primary 
vertex and the track's point of closest approach to the 
primary vertex itself is smaller (bigger) than ir/2. 

The particles in a jet coming from a light parton 
originate at the primary vertex, but these tracks are 
reconstructed with a non-zero impact parameter due 
to the finite tracking resolution. They have an equal 
probability of being positively or negatively signed. 
Jets which originate from a heavy parton contain long 
lived hadrons giving rise to tracks displaced in the jet 
direction, which preferentially populate the positive 
side of the signed impact parameter distribution. The 
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width of the negative impact parameter distribution 
is solely due to the tracking detector resolution, beam 
spot size, and multiple scattering. 

The tracking resolution can be extracted from the 
data by fitting the negative side of the signed impact 
parameter distribution of tracks from prompt jets, 
which are the dominant component of inclusive jet 
data. Tracks are divided into 72 different categories 
according to the number and quality of SVX hits, de- 
tector r\ and px- To minimize the contribution from 
badly measured tracks with a large reconstructed im- 
pact parameter, the signed impact parameter signif- 
icance, Sd (ratio of the impact parameter to its un- 
certainty), is parameterized for each track category. 
Tracks are fitted to a helix, and the impact para- 
meter is corrected for beam offsets in order to take 
into account any displacement of the primary vertex 
from the nominal position. The uncertainty in the 
impact parameter is given by the error propagation 
of the uncertainties in the fit and in the beam offset 
correction. We parameterize the impact parameter 
significance for tracks satisfying the quality criteria 
listed in Table III that are associated with jets with 
Et > 7 GeV and \rj\ < 2.5. These tracks must have 
Pt > 0.5 GeV/c, impact parameter less than 0.1 cm 
(in order to reject long lived K's and A's), three to 
five hits on different axial layers of the SVX, at least 
20 (17) hits in the COT axial (stereo) layers, and the 
z position of the track must be within 5 cm of the 
event primary vertex. Tracks passing this selection 
are called jet probability tracks. The |do| is measured 
with respect to the primary vertex. The event is re- 
quired to have a primary vertex, and the vertex with 
highest sum of transverse momentum of all tracks is 
chosen in events which have more than one vertex. 

TABLE III: Selection criteria for tracks used by the jet 
probability algorithm. 



Variable 



Cut 



PT 
\do\ 

N SVX axial 
^COT axial 
^COT stereo 

\%trk Zpv I 



> 0.5 GeV/c 
< 0.1 cm 

> 3 and < 5 

> 20 

> 17 
< 5 cm 




FIG. 2: Distribution of the impact parameter significance 
for tracks in an inclusive jet sample with at least 5 good 
SVX hits, pt > 5 GeV/c, and \rj\ < 0.6. 



impact parameter significance of a given track is due 
to the detector resolution, defined as: 



Ptr(S d0 ) 



j: l ^ dol R(s)ds 

J°_ 00 R(S)dS ' 



(1) 



The Sd distribution peaks at zero and falls quickly 
with increasing absolute value of | Sd | , but the tails 
are rather long. In order to improve the statistics and 
obtain a better fit in the tail, we use non-linear bins 
by transforming it to X — ln(\Sj |), where the minus 
sign indicates that only the negative part of Sd is 
used. Figure 3 shows the result of such a fit, together 
with the fit residues defined as (data-fit)/uncertainty, 
where the uncertainty is taken as the statistical un- 
certainty on each data point. A resolution function 
parameterized as the convolution of four Gaussians 
with means at zero is found to fit well all distribu- 
tions for all 72 track categories: 



R(S) = J2 



Pi -S 2 /2af 



2lT<Ji 



(2) 



After the transformation to a logarithmic axis, the 
resolution function becomes 



Figure 2 shows the distribution of the impact pa- 
rameter significance of tracks in an inclusive jet sam- 
ple for one of the track categories, namely tracks with 
at least 5 good SVX hits, p T > 5 GeV/c and \rj\ < 0.6. 
The negative side of this distribution is fitted with a 
function R(S) called the resolution function, which is 
used to determine the probability, Ptr {Sd ) , that the 



R(x) = 



P * e (X -^f ] (3) 



i=i 



and R(X) is used to fit the transformed X = ln(\S do |) 
distribution. 

The jet probability Pj that a jet is consistent with 
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U) 4500 




ln(|(S do )|) 

FIG. 3: Fit to the transformed impact parameter signif- 
icance, lndiS^ |), where only the negative side of the Sd 
distribution is used. The resolution function is chosen as 
the convolution of four Gaussians. The inset shows the 
residues of the fit, (data- fit) /uncertainty. 



a zero lifetime hypothesis is defined as 

fe=0 

where 

N trk 

n = n p *r (5) 

i=i 

and N tr k is the number of jet probability tracks with 
positive impact parameter. Jets arc required to have 
at least two jet probability tracks with positive im- 
pact parameter to be taggable. Both of these distri- 
butions should be uniformly distributed in the inter- 
val [0-1] for jets having only prompt tracks. Tracks 
with negative impact parameter are used to define a 
negative Pj, which is used to check the algorithm and 
to estimate the misidentification rate. We define posi- 
tive (negative) tagged jets as those jets whose positive 
(negative) Pj is less than a cutoff, where we use 1% 
(main result) and 5% (cross check). Positive tagged 
jets are expected to be enriched in heavy flavor. The 
1% cut was used in previous publications [35] and 
has similar performance to the secondary vertex tag- 
ger [7], while the loose (less restrictive) 5% cut was 
chosen near the point where the Pj distribution be- 
comes flat (see Fig. 12). Further gain in ti selection 
efficiency resulting from a looser Pj cut is accom- 
panied by an increase in background from light jets 
misidentified as heavy flavor (mistags). For compar- 



ison, both the 1% and 5% numbers and figures are 
presented together throughout the paper. 

A feature of this algorithm is that the 6-tagging 
is performed using a continuous variable instead of a 
discrete object like a reconstructed secondary vertex. 
It therefore provides a variable that allows one to 
move continuously along the efficiency curve and to 
select the optimal signal to background point for a 
specific analysis. Furthermore, the ability to adjust 
the Pj cut is a valuable tool to understand the heavy 
flavor content of the sample. Potentially [35], this 
method can be used to statistically separate b and c 
heavy flavor contributions. This feature is illustrated 
in the left plot in Fig. 4, where the jet probability 
distributions for 6, c and light jets are shown. Monte 
Carlo simulated 2 — > 2 parton events are used as des- 
cribed in Section IV A. In the right plot, we show the 
jet probability distributions observed in two different 
data sets of jets. The first sample is enriched in heavy 
flavor content by requiring the jets to contain a soft 
momentum electron; here, events are triggered on low 
Pt inclusive electrons (see Section IV A) . The second 
set consists of generic QCD jets selected by requiring 
events with at least one jet with Et > 50 GeV (the 
Jet50 sample). 

In this section we discuss the 6-tagging algorithm 
itself, independently of the other details of this ana- 
lysis. 



A. Measurement of the Tagging Efficiency for 
Heavy Quark Jets 

The method used to measure the jet probability 
tagging efficiency for heavy flavor jets is described 
in detail in Ref. [7]. The ideal events to study this 
efficiency are bb dijet events. We use a calibration 
data sample of jets whose heavy flavor fraction can 
be measured: a sample triggered on low pt inclusive 
electrons which is enriched in semileptonic decays of 
bottom and charm hadrons. The tagging efficiency 
is also measured for simulated jets by using a Monte 
Carlo sample similar to the calibration sample. We 
use HERWIG to generate 2-^2 parton events, which 
are passed through a filter requiring an electron with 
Pt > 7 GeV/c and |?7| < 1.3. Events passing this 
filter are processed using the detector simulation des- 
cribed in Section IIIF. Electrons are identified using 
a selection similar to that described in Section IIIB, 
except that they are required to be non-isolated and 
have a lower energy threshold (Et > 9 GeV and track 
Pt > 8 GeV/c). The heavy flavor content of the sam- 
ple is further enhanced by requiring two jets in the 
event, an "electron jet" , presumed to contain the de- 
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FIG. 4: Left: jet probability distributions for jets matched to b (full circles), c (empty circles) and light (empty squares) 
quarks in Monte Carlo simulated events. Right: jet probability distributions for electron jets in inclusive electron data 
(full circles) and for generic QCD jets in Jet50 data (empty squares). 



cay products of a heavy flavor hadron, and an "away 
jet" . The electron jet must have Et > 15 GeV and 
be within 0.4 of the electron direction in rj-(f) space. 
The away jet is required to have Et > 15 GeV and 
\rj\ < 1.5, and it must be approximately back-to-back 
with the electron jet (A<fi e -j > 2 rad). If the fraction 
of electron jets containing heavy flavor for which the 
away jet is tagged (F HF ) is known, and if there were 
no prompt jets misidcntificd as 6-jets, the efficiency 
to tag a heavy flavor jet containing an electron would 
be given by 



N e t 



1 



rpa 
HF 



where N^X is the number of events for which both the 
electron jet and the away jet are positively tagged, 
and N a+ is the total number of events for which the 
away jet is positively tagged. Since light jets can be 
tagged as well, we correct for this effect by subtrac- 
ting the number of negative tags. We define the posi- 
tive tag excess for events with a positive or negative 
tag in the away jet as 



A„._ 



N e 2 



1 * a 



(7) 

(8) 



is the number of events 



where, for example 
where the electron jet is negatively tagged and the 
away jet is positively tagged. The tagging efficiency 
for heavy flavor jets containing an electron is then 



given by 



Aq+ ~ A„- 

(iV a+ - N a „) F- 



(9) 



HF 



Since events with an electron jet and a tagged away 
jet are mostly due to heavy flavor pair production, 
one expects F^ F to be close to unity. This number 
is less than 1.0 due to events in which the away jet 
is mistagged or contains heavy flavor due to gluon 
splitting or flavor excitation, and the electron is ei- 
ther a jet misidcntificd as electron or part of a photon 
conversion pair. If P denotes the probability to posi- 
tively tag the away jet in an event where the electron 
jet is a light jet, then F HF is given by 



F£ F = 1- P(l - F HF ), 



(10) 



where Fhf = Fb + F c = Fb • (1 + F c / b ) denotes the 
total heavy flavor fraction of electron jets. Here 
and F c are the total b and c fractions of electron jets, 
respectively, and F c /i, is the c to b fraction ratio. We 
estimate P using identified conversions as 



p = 



N a +-N a 



N 



(11) 



where N is the number of events passing the selec- 
x and the c subscript refers to 



tion, e„ 



N, 



-N„ 



events where the electron was identified as a con- 
version. A full derivation of this expression can be 
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found in Ref. [7]. Two methods are used to mea- 
sure the b- fraction, F^, of the electron jets. The first 
method is to reconstruct D° — > K~ir + decays within 
the electron jet and use the invariant mass side- 
bands to subtract background. The second method 
involves searching for secondary muons within the 
electron jet resulting from cascade b — > c — > Ivq de- 
cays using the same-sign rate to estimate the back- 
ground. The contribution from charm, F c ^, is de- 
termined from Monte Carlo simulation to be F c / b = 
0.61 ± 0.10. For inclusive electron data we measure 
F HF = 0.259 ± 0.064 and F% F = 0.71 ± 0.05. 

The efficiencies to tag a taggable heavy flavor jet 
with Et > 15 GeV in data are summarized in Ta- 
ble IV for Pj < 1% and 5%. The ratio of data effi- 
ciency to Monte Carlo simulation efficiency is called 
the tagging scale factor (SF). The uncertainties 
shown are statistical and systematic, which are des- 
cribed below. 



TABLE IV: Efficiency to tag a taggable heavy flavor elec- 
tron jet in data and the tagging scale factor (SF) for jet 
probability cuts of 1% and 5%. 





Pj < 1% 


Pj < 5% 


e (Data) 


0.258 ± 0.011 


0.334 ± 0.016 


SF 


0.817 ± 0.07 


0.852 ± 0.072 



It is crucial to understand the tagging efficiency 
and scale factor dependences on the jet Et in or- 
der to characterize the jet probability algorithm per- 
formance. The Et dependence observed in the in- 
clusive electron sample is shown in Fig. 5 and 6 for 
the tagging efficiency and scale factor respectively. 
Due to the lack of statistics at high jet Et, we re- 
peat the study using two samples of high energy 
jets selected by requiring events with at least one 
jet with E T > 20 GeV (the Jet20 sample) or with 
E T > 50 GeV (the Jet50 sample) . The absolute value 
of the SF can not be extracted because of the un- 
known content of heavy flavor in these samples. How- 
ever, since the variations of heavy flavor fraction are 
relatively small over a large range of Et, we can still 
estimate the Et dependence of the scale factor from 
the Et dependence of the ratio of positive tag excess 
between data and Monte Carlo simulation. Table V 
shows the results of a linear fit of the tagging scale 
factor to the jet Et in the inclusive electron, Jet20 
and Jet50 samples. The combined estimate of the 
slopes is found to be consistent with a flat Et depen- 
dence of the scale factor both when a Pj cut of 1% 
and 5% is applied. Based on these results, we con- 
clude that the scale factor measured in the inclusive 
electron sample is valid at any Et- 



Different sources of systematic uncertainty in the 
determination of SF have been considered. An un- 
certainty on the value of Ft,, determined from the 
rate of D° — > Kit decays, comes from the branching 
ratio BR(B -» luD°X). A factor 1.131 ± 0.070 is 
used to normalize the Monte Carlo simulation pre- 
diction to the PDG [36] value. The uncertainty in- 
cludes both the PDG branching ratio uncertainty and 
the Monte Carlo simulation statistical error. Another 
uncertainty on Ft, comes from the difference in D° re- 
construction efficiency, ejjo, between data and Monte 
Carlo simulation. This uncertainty is derived by 
studying the efficiency of reconstruction for simulated 
D° — > Kit decays embedded into data events, and is 
found to be 10%. There is an additional uncertainty 
due to the assumption of symmetry between negative 
tags and positive mistags implicit in the derivation of 
Equation 9. The effect of a mistag asymmetry is esti- 
mated by scaling the subtracted negative tags by di- 
fferent factors ranging from to 2 (0.4 to 1.4) for 1% 
(5%) jet probability cuts, and an uncertainty of 7% 
is conservatively derived on the tagging scale factor 
due to this effect. Final estimates for jet probability 
tagging efficiencies and scale factors are summarized 
in Table IV. 

We do not measure the tagging scale factor for c 
jets. We assume a common scale factor for jets from 
b and c quarks and we increase the uncertainty for 
a c quark scale factor by 100% to take into account 
additional uncertainties due to this assumption. 



B. Measurement of the Mistag Rate 

An important ingredient of any analysis which uses 
heavy flavor tagging is the background from light 
quark or gluon jets incorrectly tagged as heavy flavor. 
The probability of (positively) tagging a light jet (the 
"mistag rate") is closely related to the negative tag 
rate. We remind the reader that a positive (negative) 
Pj is calculated using positively (negatively) signed 
impact parameter tracks, and a jet which has positive 
(negative) Pj smaller than a certain cut is said to be 
positively (negatively) tagged. It is assumed that the 
negative tags are due to detector resolution effects 
only, while the positive tag rate has an additional 
contribution from real heavy flavor in the jets. Under 
this assumption, the mistag rate is equal to the nega- 
tive tag rate, although in reality there is also a small 
contribution from heavy flavor jets to the negative 
tag rate and there are contributions from K's, A's 
and nuclear interactions with the detector material 
to the positive tag rate. These effects are considered 
later in Section IV B 1. 
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FIG. 5: Efficiency to tag a heavy flavor jet as a function of corrected jet Et in data and Monte Carlo simulation for 
1% (left) and 5% (right) jet probability cut values. 
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FIG. 6: The scale factor (SF) as a function of corrected jet Et for 1% (left) and 5% (right) jet probability cut values. 



TABLE V: Summary of the scale factor vs. Et slope measurements in various samples. 



Sample 


Pj < 1% 


Pj < 5% 


Inclusive Electron 


-0.0082 ± 0.0037 


-0.0081 ± 0.0044 


Jet 20 


-0.0008 ± 0.0019 


-0.0028 ± 0.0024 


Jet 50 


0.0005 ± 0.0008 


0.0005 ± 0.0009 


Weighted Average 


-0.00002 ± 0.00070 


-0.00020 ± 0.00072 



Since the tag rate has a considerable dependence on tag rate matrix, or look-up table, of the transverse 
jet kinematics, it is parameterized as a 6 dimensional energy Et of the jet, the number of jet probability 
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tracks in the jet N tr k, the sum of the transverse e- 
nergy of all jets in the event Y E 3 T et , the rj of the jet 
computed with respect to the center of the detector, 
the z vertex position z vtx and the <f> of the jet. The 
tag rates are obtained from four inclusive jet samples 
selected by requiring the Et of the most energetic jet 
in the event to be greater than 20, 50, 70 or 100 GeV 
respectively. For a 1% (5%) Pj cut, the overall ne- 
gative tag rate is (1.22 ± 0.08)% ((5.30 ± 0.25)%), 
while the overall positive tag rate is (3.54 ± 0.18)% 
((9.20 ± 0.26)%). Overall tag rates depend on the 
sample, which is why the tag rates are parameterized 
as a function of different variables. Figure 7 shows 
the negative tag rates for Pj < 1% and Pj < 5% as 
a function of the jet Et and pseudo-rapidity. The 
bands represent the total uncertainty. 
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We estimate the systematic uncertainties by com- 
paring the observed and predicted tag rates in diffe- 
rent data samples. We apply tag rate matrices, con- 
structed using different inclusive jet subsamplcs, to 
different datasets. Results are shown in Table VI for 
a Pj cut of 1%. The largest deviation between ob- 
served and predicted tag rates across the different jet 
samples is taken as the systematic uncertainty due 
to the sample dependence of the matrix. In order to 
account for any possible bias due to the trigger selec- 
tion, we apply the matrix separately to trigger and 
non-trigger jets. A trigger jet is defined as the jet 
closest in r]-<f) space to the level 2 cluster that fired 
the trigger. We also apply a matrix built with a high 
statistics sample of Jet20 events, to the Jet50 sam- 
ple which has several jets below the trigger threshold 
of 50 GeV. Also considered is a sample selected by 
requiring at least four jets with Et > 15 GeV and 

E T et > 125 GeV. These events are expected to 
give a reasonable estimate of the systematic uncer- 
tainty because of the higher jet and track multiplici- 
ties. Furthermore, this sample is not used to build the 
matrix, making it sensitive to any additional sources 
of systematic uncertainty. Figure 8 compares the ob- 
served and predicted tag rates in the Y E T et sample 
as a function of jet Et- The total systematic un- 
certainty on the overall tag rates is conservatively 
taken as the sum in quadrature of the Et > 3et20 
to Jet50, and the largest of the trigger and sample 
contributions. Table VII summarizes the relative un- 
certainties on the overall tag rates for Pj cuts of 1% 
and 5%. The total relative uncertainty is 5.0% (2.8%) 
for positive tag rate and 6.7% (4.7%) for negative tag 
rate for a Pj cut of 1% (5%). 
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FIG. 7: Mistag rate for jet probability cuts of 1% and 
5% as a function of jet Et (top) and jet pseudo-rapidity 
(bottom) in inclusive jet data sample. The bands repre- 
sent the statistical and systematic uncertainties added in 
quadrature. 



1. Mistag Asymmetry 

The rate of negatively tagged jets does not reflect 
the rate of positive mistags of light jets because of 
residual lifetime effects from A's and K's or inter- 
actions with the detector material. Corrections for 
these effects are determined by studying the flavor 
composition of tagged jets in data. 

The set of jet probability tracks inside a tagged 
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TABLE VI: Ratios of observed to predicted rates of positive and negative tags when a Pj cut of 1% is applied. The 
first column specifies the sample used to build the matrix, while the second column reports the sample used to compute 
the rates. All(even) [odd] means that all(only even event number) [only odd event number] events are used. The errors 
shown are statistical only. 



Matrix 


Sample 


Obs./Pred. Pos. Tag Rate Ratio 


Obs./Prcd. Neg. Tag Rate Ratio 


Inc. Jet Even 


Inc. Jet Odd 


0.997±0.002 


0.999±0.003 


Inc. Jet Even 


Jet20 Odd 


0.987±0.003 


0.970±0.006 


Inc. Jet Even 


Jet50 Odd 


0.991±0.003 


0.998±0.006 


Inc. Jet Even 


Jet70 Odd 


0.997±0.004 


0.996±0.006 


Inc. Jet Even 


JetlOO Odd 


0.989±0.003 


1.029±0.005 


Jet20 All 


Jet50 All 


1.020±0.003 


1.044±0.008 


Inc. Jet Even 


Trig. Jet Odd 


0.976±0.002 


0.978±0.004 


Inc. Jet Even 


Non trig. Jet Odd 


1.028±0.003 


1.028±0.005 


Inc. Jet All 


£ E ] T et All 


1.037±0.002 


0.966±0.003 



TABLE VII: Total relative uncertainties on the overall positive and negative tag rates. Different tag rate matrices are 
applied to orthogonal samples of jets as shown in Table VI, and the total uncertainty is taken as the sum in quadrature 
of the most relevant contributions. Sample refers to the largest uncertainty from lines 2 to 5 in Table VI. 



Pj CUt 


Statistical 


Trigger 




Jet20^Jct50 


Sample 


TOTAL 


POS. Pj < 1% 


0.11% 


2.4% 


3.7% 


2.0% 


1.3% 


5.0% 


Neg. Pj < 1% 


0.25% 


2.2% 


3.4% 


4.4% 


3.0% 


6.7% 


Pos. Pj < 5% 


0.07% 


1.5% 


1.2% 


1.6% 


1.2% 


2.8% 


Neg. Pj < 5% 


0.09% 


1.3% 


2.4% 


3.1% 


2.2% 


4.7% 



jet is used to build a variable sensitive to the flavor 
content of the jet itself. The relative contributions 
from heavy and light partons to data are determined 
by fitting the distribution of this variable for tagged 
jets in data to Monte Carlo simulation templates for 
b, c and light jets. For data, a sample selected by 
requiring a jet with Et > 50 GeV at the trigger level 
is used. For Monte Carlo simulation distributions, 
HERWIG is used to generate 2 — > 2 processes with an 
outgoing parton px > 40 GeV/c. We perform the 
fit using six different variables, the maximum impact 
parameter do of the tracks in the jet, the maximum 
impact parameter significance Sd of the tracks in 
the jet, the mass of the system of tracks with |c?o| > 
0.01 cm and Sd > 2, and the transverse momentum 
(P?p el ) with respect to the jet direction of the system 
of tracks with |do| > 0-01 cm and Sd > 2. 

The fit is made more robust by fitting the posi- 
tive excess only, for which the distributions for nega- 
tive tags are subtracted from the positive side. This 
removes contribution to the mistags due to detec- 
tor resolution, which could be simulated poorly. The 
number of negative tags obtained for b, c and light 
jets in Monte Carlo simulation is normalized to the 
total number of negative tags found in data. From 



the fit, the fractions of b, c and light jets in data are 
obtained; thus the ratio of positive to negative tags 
from light jets, (3. 

Figure 9 shows the result of the fit of the positive 
tag excess in data to Monte Carlo templates of the 
maximum impact parameter of the tracks contained 
within b, c and light tagged jets. A Pj cut of 1% 
is used. It should be noted that the c/b ratio gets 
a contribution from the c/b tagging efficiency ratio 
of about 0.2. The observed rise of light jets is the 
result of the fact that tags for light jets are usually 
due to one large impact parameter track. Table VIII 
summarizes the results of the mistag asymmetry mea- 
surement with the six variables chosen for Pj < 1% 
and 5%. As a final estimate of the mistag asymmetry, 
we take the average of the six measurements and as- 
sign the maximum difference between the average and 
each single determination as the uncertainty. The re- 
sults are 1.56 ± 0.14 and 1.27 ± 0.17 for Pj cuts of 
1% and 5%, respectively. The asymmetry is caused 
by secondary interactions with the detector material 
and residual lifetime effects from K 's and A's, giving 
an excess of positive mistags. We study the expected 
contribution of K's and A's decays to the mistag 
asymmetry in Monte Carlo simulated events. We find 
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4 (GeV) 

FIG. 8: Observed positive (top) and negative (bottom) 
tag rates as a function of Et for events in Ft data vs. 
prediction from the matrix built with events in inclusive 
jet data. The two curves correspond to Pj cuts of 1% 
and 5%. 
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FIG. 9: Result of the fit of the positive tag excess in Jet50 
data as a function of the maximum impact parameter do 
of jet probability tracks inside the tagged jets. A Pj cut 
of 1% is used. Observed data are the crosses, while the 
fitted component from b, c and light jets are the squares, 
triangles and diamonds, respectively. The circles are the 
sum of the three fitted components. 



the ratio of positively to negatively tagged light jets 
to be 1.55±0.11 (1.21±0.04) for a Pj cut of 1% (5%). 
Uncertainties are statistical only. These results are in 
good agreement with our measurements on data and 
suggest K's and A's to be the main source of mistag 
asymmetry. The negative tag rates measured have 
therefore to be scaled up by the asymmetry factor 
in order to obtain an accurate estimate of the posi- 
tive mistag rate. We repeat the measurement in bins 
of jet transverse energy to study the dependence of 
the mistag asymmetry on the jet Et- Results are 
shown in Fig. 10. The asymmetry exhibits a small 
dependence with jet Et which is taken into account 
to estimate the mistag background. 
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FIG. 10: Mistag asymmetry as a function of the jet trans- 
verse energy for Pj < 1% (left plot) and Pj < 5% (right 
plot). 



C. Jet Probability Performance on tt Events 

We study the performance of the jet probability al- 
gorithm by computing the efficiency to tag a b jet in 
PYTHIA Monte Carlo tt events generated with a top 
mass =178 GeV/c 2 . Results are shown in Fig. 11 
as a function of the transverse energy Et and of 
the pseudo-rapidity -q of the jets for Pj cuts of 1% 
and 5%. Jets are matched to b quarks (by requir- 
ing AR < 0.4 between the reconstructed jet and the 
b quark) and the tagging SF is applied to the resul- 
ting efficiency. We also measure the average efficiency 



to tag a b or a c jet in tt events passing the kine- 
matic event selection described in Section V. Results 
are shown in Table IX before and after applying the 
tagging SF. The scaled per-jet efficiencies, together 
with the mistag matrix, are used to determine the ef- 
ficiency to tag at least n jets per event, as described 
in Section VII. Although the tagging requirement 
results in some loss of efficiency for the tt signal, it 
significantly increases the signal-to-background ratio 
by heavily suppressing the dominant IF+jets back- 
ground. 
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TABLE VIII: Mistag asymmetry measured in Jet50 data for Pj cuts at 1% and 5%. The quoted uncertainties are 
derived from the uncertainty in the fits. The uncertainty used for the average is the maximum difference between the 
average and each measurement. 

Fitted variable g [Pj < 1%) g (Pj < 5%) 



Maximum <i 


1.64 ± 0.02 


1.37 ± 0.02 


Maximum Sd a 


1.56 ± 0.03 


1.10 ± 0.02 


Mass of the system of tracks with d > 0.01 cm 


1.51 ± 0.04 


1.30 ± 0.02 


Mass of the system of tracks with Sd a > 2 


1.43 ± 0.03 


1.20 ± 0.02 


P^'- of the system of tracks with \d \ > 0.01 cm 


1.67 ± 0.03 


1.32 ± 0.02 


P?r eL of the system of tracks with Sd > 2 


1.57 ± 0.02 


1.30 ± 0.02 


Average 


1.56 ± 0.14 


1.27 ± 0.17 



TABLE IX: Tagging efficiencies for b and c jets in tt events (m t = 178 GeV/c 2 ) before (raw efficiency) and after (scaled 
efficiency) applying the tagging scale factor. 







b jets 




c jets 




Raw Eff. (%) 


Scaled Eff. (%) 


Raw Eff. (%) 


Scaled Eff. (%) 


Pj < 1% 


43.2 ±0.1 


35.3 ±3.0 


9.6 ±0.2 


7.8 ±0.7 


Pj < 5% 


54.6 ±0.1 


46.5 ±3.9 


20.3 ±0.2 


17.3 ± 1.5 



V. EVENT SELECTION 

Top quark events in the lepton+jcts channel are 
characterized by the presence of an electron or muon 
with high transverse energy, large missing transverse 
energy and four high energy jets, two of which are 
b jets. The basic pretag selection requires one tight 
electron or muon, 1$ T > 20 GeV and jets with cor- 
rected E T > 15 GeV and \r)\ < 2. 

In order to select a lepton+jets sample com- 
pletely disjoint from the top dilepton sample (tt — > 
l + l~vvqq), we reject events with an extra lepton that 
passes the loose requirements. Events consistent with 
Z — ► l + l~ are removed if a tight lepton and a second 
object form an invariant mass within the range [76, 
106] GeV/c 2 . If the tight lepton is an electron, the 
second object may be an isolated electromagnetic ob- 
ject, a jet with electromagnetic fraction greater than 
0.95 or an oppositely-signed isolated track. If the 
tight lepton is a muon, the second object may be an 
isolated muon or an opposite-signed isolated track. 

The event vertex z position is used to cluster jets 
and to ensure leptons and jets come from the same 
interaction. If more than one primary vertex is recon- 
structed in the event, the vertex closest to the lepton 
track is selected as event vertex. Events are rejected 
if the z of the lepton track is farther than 5 cm from 
the z of the event vertex. The vertex z position is 
required to be within 60 cm of the center of the de- 
tector in order to ensure good event reconstruction in 



the projective tower geometry of the CDF detector. 
The efficiency of this requirement is measured using 
minimum bias data and found to be (95.1 ± 0.3)%. 
For consistency with the 6-tagging algorithm, events 
are also rejected if the z of the vertex with highest 
^2 Pt of all tracks is farther than 5 cm from the event 
vertex z. The efficiency of this requirement is (98 ± 
2)%, where the 2% error accounts for the uncertainty 
in the simulation of multiple interactions. 

The events selected by the above criteria are domi- 
nated by QCD production of W bosons in association 
with jets. Figure 12 shows the Pj distribution for 
taggable jets in this sample. In order to improve the 
signal to background ratio for tt events, we require 
at least one jet in the event to be tagged as a b jet. 
A tt event is expected to have four jets in the final 
state, but due to gluon radiation, jet merging, and 
inefficiencies in jet reconstruction, this number can 
eventually be different. We therefore use the tagged 
events with three or more jets to define our signal 
sample, while the events with one and two jets are 
used as a control sample. 



A. Optimized Selection 

The variable Ht, defined as the scalar sum of all 
the transverse energy in the event, i.e., the sum of 
the $ T , the electron Et or muon pt, and the Et 
of the jets, is a measure of the energy in the hard 
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FIG. 11: Efficiency to tag b jets in ti Monte Carlo simu- 
lated events as a function of jet Et (top) and jet r\ (bot- 
tom) for two different Pj cuts. The efficiency is obtained 
by multiplying the tag rate for jets matched to b quarks 
in the Monte Carlo simulation by the appropriate tagging 
scale factor. The bands represent the systematic uncer- 
tainty on the scale factors. 
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FIG. 12: The Pj distribution for taggable jets in the pre- 
tag sample (note that the vertical axis does not start at 
0). The inset shows a zoom of the Pj distribution from 
0.0 to 0.1. 



(top) shows the Mjf distribution for the ti Monte 
Carlo simulation together with the various normal- 
ized background contributions. Figure 14 (bottom) 
shows the statistical significance as a function of the 
Mjf cut. Note that the non-VF background lies at 
lower values of Mjf. In the optimization of the Mjf 
cut, we take S to be the number of events from real 
W bosons and B to be the number of events from 
non-W background. A cut of > 20 GeV/c 2 

gives optimal statistical significance. 



scatter, and is a powerful discriminant between the 
ti pair production signal events (S) and background 
events (B). In order to find the optimal Ht cut, we 
maximize the statistical significance (S/y/S + B) in 
the signal region. Figure 13 (top) shows the Ht dis- 
tribution of the ti Monte Carlo simulation, together 
with the various background contributions, properly 
normalized. Figure 13 (bottom) shows the statisti- 
cal significance as a function of the Ht cut. Details 
about the background estimates and datasets used 
can be found in Section VI. Optimal statistical sig- 
nificance is reached with a cut of Ht > 200 GeV. 

In addition, we enhance the W component 
of the sample by requiring the transverse 
mass of the lepton and the missing energy, 

M^^^J (E T (l) + E T (v)f - (P T (l) + PtW))\ be 
consistent with W boson production. Figure 14 



B. Yields of Events 

Events which pass the selection criteria described 
so far, before applying 6-tagging, form the pretag 
sample. The number of observed events in both the 
pretag and tagged samples for Pj < 1% and Pj < 
5% are summarized in Table X as a function of the 
number of tight jets in the event. 



VI. BACKGROUNDS: EXPECTED 
COMPOSITION OF THE b-TAGGED 
LEPTON+JETS SAMPLE 

Other processes besides ti are expected to con- 
tribute to the tagged lepton+jcts sample. The main 
contribution comes from heavy flavor production 
in association with a W boson (Wbb, Wcc, Wc). 
VF+light flavor production also gives a significant 
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TABLE X: 


Yield of events in 


318 ptr 1 of data for Pj < 1% and Pj 


< 5%. 


Jet Multiplicity 


1 jet 


2 jets 


3 jets 


> 4 jets 


Prctag Events 


CEM 


16897 


2657 


182 


105 


CMUP 


8169 


1175 


83 


44 


CMX 


4273 


610 


35 


17 


Total 


29339 


4442 


300 


166 


Single Tagged Events, Pj < 1% (5%) 


CEM 


207 (571) 


106 (230) 


33 (53) 


36 (53) 


CMUP 


92 (256) 


58 (105) 


13 (24) 


24 (29) 


CMX 


51 (148) 


27 (50) 


6 (10) 


8(11) 


Total 


350 (975) 


191 (385) 


52 (87) 


68 (93) 




Double Tag 


gcd Events, Pj < 1% (5%) 




CEM 




8 (16) 


7(15) 


9 (18) 


CMUP 




3(9) 


4(4) 


8 (17) 


CMX 




2(3) 


1 (3) 


1(4) 


Total 




13 (28) 


12 (22) 


18 (39) 



contribution due to mistagged jets. Smaller contribu- 
tions come from electroweak processes (diboson pro- 
duction, Z — > t + t~ events or single top) and generic 
QCD jet production with misidentified W bosons. 
These backgrounds are described in the following sub- 
sections. 



The total diboson, Z — > t + t and single top pre- 
dictions for Pj < 1% (5%) are shown in Table XIX 
(Table XX) and account for 2.5% (3.0%) of the num- 
ber of events in the signal region of 3 and >4 jets. 
Following the same procedure, we also compute the 
electroweak background contributions to the pretag 
sample. The results are shown in Table XVIII. 



A. Electroweak Processes 

Electroweak processes are studied using Monte 
Carlo simulated samples. Diboson events (WW, WZ 
and ZZ) can contribute to the tagged lepton+jets 
sample when one boson decays leptonically and the 
other decays into heavy quarks. The process Z — > 
t + t~ can also give a contribution due to the leptonic 
decays of the tau. Finally, there is a contribution 
from single top quarks produced in association with a 
b quark through qq annihilation in W* (s-channel) or 
VF-gluon fusion (t-channel), in which an initial gluon 
splits into a bb pair and a b quark interacts with a 
virtual W. 

The number of events from these processes are pre- 
dicted based on their theoretical cross sections [37- 
39] (listed in Table XI), the measured integrated lu- 
minosity, and the acceptances and tagging efficiencies 
derived from Monte Carlo simulations. The expec- 
tations for these backgrounds are corrected for dif- 
ferences between Monte Carlo simulations and data, 
which include the lepton identification scale factor, 
trigger efficiencies, the z vertex cut efficiency and the 
tagging scale factor. 



TABLE XI: Cross sections used to estimate electroweak 
backgrounds. For diboson and single top production, the 
theoretical values are used. For Z — > t + t~, we use the 
cross section measured by CDF. 



Process 



Cross Section (pb) 



WW 

WZ 

ZZ 

Single Top W — g (t-channel) 
Single Top W* (s-channel) 



13.25 ± 0.25 
3.96 ± 0.06 
1.58 ± 0.02 
1.98 ± 0.08 
0.88 ± 0.05 
254.3 ± 5.4 



B. Non-W Background 

The non-VF background consists of events for which 
the lepton+_^ T signature is not due to the decay of 
a W boson. The main contribution to this source of 
background comes from QCD jet production where a 
jet provides the signature of a lepton and the miss- 
ing transverse energy is due to a bad measurement of 
the jet energies. Semileptonic decays of b mesons and 
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FIG. 13: Top: Ht distribution for tagged events with 3 
or more tight jets in tt Monte Carlo simulation (6.1 pb) 
and main backgrounds, for an integrated luminosity of 
318 pb -1 . Bottom: statistical significance as a function 
of the cut applied. 



FIG. 14: Top: distribution for tagged events with 3 

or more tight jets in tt Monte Carlo simulation (6.1 pb) 
and main backgrounds, for an integrated luminosity of 
318 pb~ . Bottom: statistical significance as a function 
of the cut applied. 



misidcntificd photon conversions can also contribute. 
Due to its inherent instrumental nature, this back- 
ground is difficult to estimate. In the event selection, 
its contribution to the lepton+jets sample is mini- 
mized by the requirement on the W boson transverse 
mass MjT. In particular, note that the optimization 
of this cut has been performed by requiring the lepton 
to be non-isolated in order to have an independent 
data sample to construct the kinematical variables 
(we use region C of Fig. 15). 

The method used to estimate the non-W back- 
ground assumes that the isolation of the high-py lep- 
ton and the event I$ T are uncorrelated for QCD pro- 
cesses, so that the ratio of non-VF events with low lep- 
ton isolation to those with high lepton isolation in the 
region at low J} T is the same as in the high I$ T region. 



Four regions in the lepton isolation versus missing 
transverse energy plane are defined (see Fig. 15): 

• Region A: Isolation > 0.2 and $ T < 15 GeV 

• Region B: Isolation < 0.1 and $ T < 15 GeV 

• Region C: Isolation > 0.2 and $ T > 20 GeV 

• Region D: Isolation < 0.1 and $ T > 20 GeV. 

The tt signal is expected to populate region D (sig- 
nal region), while the non-14/ events dominate regions 
A, B and C (sideband regions). We can therefore esti- 
mate the fraction of events in the signal region which 
originate from non-W backgrounds as follows: 

Fnrm—W = TT : . (12) 
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FIG. 15: Definition of the sideband regions used to esti- 
mate the non-VF background. Lepton isolation versus 
missing transverse energy distribution for ti simulated 
events is also shown. 



where Na, N b , Nc and Nd are the total numbers of 
observed events in the four regions. We describe next 
the estimate of the non-VF events in both the pretag 
and tagged samples. 

1. Fraction of non-W Events in the Pretag Sample 

An estimate of the contribution of the non-VF 
events to the pretag sample is mandatory to correctly 
normalize most of the backgrounds in the tagged sam- 
ple. Table XII summarizes the results for the non-W 
fractions in the pretag sample as a function of the 
jet multiplicity for electrons and muons. Note that 
we do not apply the Ht and Mjf cuts in regions A 
and B to preserve statistics. We correct the yields 
in regions A, B and C by subtracting the expected 
contribution from ti events assuming a t i = 6.1 pb 
(this assumption is found to have a negligible im- 
pact on the final non-W estimate). Uncertainties in 
Table XII are statistical only. The main source of 
systematic uncertainty comes from the lepton isola- 
tion and missing transverse energy not being fully 
uncorrelated for QCD events. We study the effect of 
this assumption by varying the values of the I$ T and 
lepton isolation cuts in the definition of the sideband 
regions. We observe a maximum variation of 50% in 
the resulting non-W fraction, which we assign as a 
systematic uncertainty on our estimates. 

To further cross check the accuracy of the predic- 
tions, we define new intermediate isolation regions B' 
and D': 

• Region B: $ T < 15, Isol< 0.1 — > Region B': 
I/jj, < 15, O.KIsoK 0.2, 



• Region D: $ T > 20, Isol< 0.1 -> Region D': 
Ifirp > 20, 0.1<Isol< 0.2. 

From the intermediate region B', we estimate the 
number of non-W events in region D'. The predicted 
non-PF fractions are shown in Table XIII. The uncer- 
tainties quoted are statistical only. In the same table, 
these fractions are compared with the expected non- 
W fractions computed from the difference between 
the observed events and the contributions from ti and 
FF+jets events. The expected number of ti events is 
derived by normalizing the Monte Carlo prediction 
to a cross section of 6.1 pb. In order to estimate 
the H^+jets contribution in region D', we compute 
the ratio of ly+jets events in the regions D' and D 
using simulations and normalize the expectations for 
IF+jets production in D' to the number of events in 
the signal region after removing ti, electroweak con- 
tributions. We compute the relative differences as 
the ratio of the difference between expected and pre- 
dicted non-VF fractions to the predicted fraction. For 
each jet multiplicity bin, the differences between pre- 
dicted and expected non-VF fractions in region D' are 
consistent with the 50% uncertainty we derived vary- 
ing the If T and lepton isolation cuts in the definition 
of the sideband regions. 

2. Non- W Events in the Tagged Sample 

The non-W background contributes to the tagged 
sample through both real heavy flavor production (bb 
and cc events) and mistags. We compute the number 
of non-W events with tagged jets in the signal region 
using equation 12 with the numbers of tagged events 
in the sideband regions. Yields in regions A, B and 
C are corrected for ti contributions. The results are 
summarized in Table XIV: {N B / N A ) ta sa ed is the ra- 
tio of tagged events in regions B and A and it is used 
to normalize the number N^" ed f tagged events in 
the region C to get the expected number N^^^oi 
non-W events on the signal region. The precision of 
these estimates is limited by the number of tagged 
events in the sideband regions. 

We cross check these results by estimating the non- 
W contribution to the tagged lepton+jets sample 
following two alternative methods. In the first one 
{check 1), we assume the tag rate in region D to be 
the same as in region B: 

N non-W = F non-W y-N D x e B , (13) 

where e B is the event tag rate in region B and Njj 
is the number of events in region D. This method 
has a large systematic uncertainty since the tag rate 



23 



TABLE XII: Number of events in the sideband regions and fraction of non-W events in the signal region before and 
after correcting for ti contribution. Quoted errors are statistical only. 



Jet Multiplicity 


1 jet 


2 jets 


3 jets 


> 4 jets 


Prctag Electrons 


Region A 


100600 


12756 


1745 


216 


Region B 


61818 


5228 


593 


98 


Region C 


1651 


428 


27 


15 


Region D 


16897 


2657 


182 


105 


T^uncorr. 
non—W 


0.060 ± 0.002 


0.066 ± 0.004 


0.05 ± 0.01 


0.06 ± 0.02 


F non — W 


0.060 ± 0.002 


0.066 ± 0.004 


0.05 ± 0.01 


0.05 ± 0.02 


Pretag Muons 


Region A 


36599 


5248 


657 


97 


Region B 


11718 


968 


114 


21 


Region C 


737 


181 


12 


11 


Region D 


12442 


1785 


118 


61 


T^uncorr. 
non—W 


0.0190 ± 0.0007 


0.019 ± 0.002 


0.018 ± 0.006 


0.04 ± 0.02 


^ non—W 


0.0190 ± 0.0007 


0.019 ± 0.002 


0.014 ± 0.005 


0.03 ± 0.01 



TABLE XIII: Predicted and expected fractions of non-VF events in the intermediate region D' for the electron and 
muon samples. Errors are statistical only. 



Jet Multiplicity 


1 jet 


2 jets 


3 jets 


> 4 jets 


Elcctron+Jcts Sample 


Predicted Non-VF Fraction 


0.82 ± 0.03 


0.60 ± 0.05 


0.53 ± 0.16 


0.49 ± 0.21 


Expected Non-W Fraction 


0.78 ± 0.01 


0.82 ± 0.02 


0.76 ± 0.10 


0.59 ± 0.16 


Fractional Relative Difference 


-0.05 


0.37 


0.42 


0.20 




Muon+ 


Jets Sample 






Predicted Non-VF Fraction 


0.41 ± 0.03 


0.41 ± 0.06 


0.40 ± 0.23 


0.40 ± 0.33 


Expected Non-TF Fraction 


0.70 ± 0.02 


0.62 ± 0.05 


0.44 ± 0.22 


0.26 ± 0.25 


Fractional Relative Difference 


0.70 


0.50 


0.11 


-0.35 



could depend on the missing transverse energy due 
to the contribution of 66 events with a b quark decay- 
ing into leptons. Events with large I$ T would have a 
larger heavy flavor contribution due to real neutrino 
production from semileptonic 6 decay. In the second 
alternative method (check 2), we compute the tag- 
ging rates per jet in the sideband regions, and then 
we predict the tag rate per jet in the signal region D 
as 



Tag Rate B x Tag Rate C 
Tag Rate A 



Pred Tag Rate D 

i ail unit . i 

We compute the jet tagging rate by assuming it to be 
the same in all the jet multiplicity bins and use this 
estimate to predict the non-W background in the sig- 
nal region taking into account the jet multiplicity and 
the number of non-14 7 events expected in the pretag 



lepton+jets sample. Table XV compares the non-W 
contributions predicted by the three methods. 

Finally, we use the results of the two alternative 
estimates to assign a systematic uncertainty of 50% 
which takes into account the differences with the base 
method. The total non-W background accounts for 
1.2% of the observed events with tagged jets in the 
signal region, both for Pj < 1% and Pj < 5%. 



C. W + Heavy Flavor Processes 

TF+heavy flavor production is the main source of 
background in the tagged lepton+jets sample. It is 
estimated using the heavy flavor fractions in W bo- 
son production in association with partons and the 
tagging efficiency for these processes. These quanti- 
ties are derived from Monte Carlo simulations. The 
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TABLE XIV: The number of non-W events in the signal region D estimated from the corrected numbers of tagged 
events in the sideband regions with equation 12. Uncertainties are statistical only. 



Jet Multiplicity 


1 jet 


2 jets 


3 jets 


> 4 jets 




Elcctron+Jcts Sample (Pj < 15 


o) 




(N B /N A ) ta 39 ed 


0.36 ± 0.01 


0.26± 0.02 


0.26 ± 0.05 


0.4 ± 0.2 


^tagged 


74.8 


25.1 


1.8 


1.0 


i\jtagged,e 
^non-W 


26.7 ± 3.3 


6.6 ± 1.4 


0.5 ± 0.4 


0.4 ± 0.4 


Muon+Jets Sample (Pj < 1%) 


(N B /N A y ag9eci 


0.102 ± 0.008 


0.10 ± 0.02 


0.11 ± 0.04 


0.2 ± 0.1 


^tagged 


36.9 


20.3 


4.0 


0.81 


]\jtagged,fj, 
J1 non-W 


3.8 ± 0.7 


2.0 ± 0.6 


0.5 ± 0.3 


0.2 ± 0.1 




Electron-i 


-Jets Sample (Pj < 59 


o) 




(N B /N A y a99ed 


0.42 ± 0.01 


0.33 ± 0.02 


0.29 ± 0.04 


0.5 ± 0.1 


^tagged 


142.8 


52.9 


3.5 


1.5 


AT tagged,e 


59.6 ± 5.2 


17.6 ± 2.6 


1.0 ± 0.5 


0.7 ± 0.6 


Muon+Jets Sample (Pj < 5%) 


(N B /N A ) ta 93 ed 


0.141 ± 0.007 


0.12 ± 0.01 


0.09 ± 0.03 


0.17 ± 0.07 


^tagged 


65.8 


32.1 


3.7 


0.6 


]\jtagged,fj, 
ly non-W 


9.3 ± 1.2 


3.8 ± 0.8 


0.3 ± 0.2 


0.1 ± 0.1 



TABLE XV: Number of non-W events expected in the tagged lepton+jets sample as a function of the jet multiplicity 
for the three methods described. Uncertainties are statistical only. 



Jet Multiplicity 


1 jet 


2 jets 


3 jets 


> 4 jets 


Pj < 1% 


AT tagged 

ly non-W 
pjtagged, check 1 

ly non-W 
jijtagged, check 2 

JV non-W 


30.5 ± 3.3 
19.0 ± 0.7 
27.7 ± 2.6 


8.6 ± 1.5 

6.7 ± 0.6 
9.3 ± 0.9 


0.9 ± 0.5 
0.6 ± 0.1 
0.7 ± 0.1 


0.5 ± 0.4 
0.6 ± 0.3 
0.6 ± 0.2 


Pj < 5% 


AT tagged 
ly non-W 
Tsjtagged, check 1 
ly non-W 
jijtagged, check 2 
Iy non-W 


68.8 ± 5.4 
43.5 ± 1.3 
65.4 ± 4.4 


21.4 ± 2.8 
16.2 ± 1.0 
21.9 ± 1.8 


1.3 ± 0.6 
1.1 ± 0.2 
1.6 ± 0.3 


0.8 ± 0.6 
1.6 ± 0.5 
1.5 ± 0.4 



overall normalization is obtained from the number of 
observed events in the pretag sample. 

The estimate of the heavy flavor fraction in ly+jets 
events is described elsewhere [7] . We use the ALPGEN 
event generator, which is able to compute exact ma- 
trix clement calculations at leading order for parton 
level QCD and electroweak processes. We can there- 
fore compute the ratio between the VF+heavy flavor 
production cross section and the inclusive ly+jets 
cross section since it is expected to be stable in 
the transition from leading-order to next-to-leading- 
order matrix elements. We generate events where 
inclusive W, Wbb, Wcc and Wc are produced in as- 



sociation with n light partons. Parton level events 
from ALPGEN are fed to the HERWIG parton shower 
program which generates additional jets from gluon 
radiation, and a full CDF detector simulation is ap- 
plied. Events containing a different number n of light 
partons are combined following a rigorous prescrip- 
tion in order to avoid double counting due to parton 
shower radiation, which causes W + n parton events 
to populate part of the phase space described by the 
W+(n + 1) parton sample. The Wbb and Wcc sam- 
ples are further divided into two classes according to 
the number of reconstructed heavy flavor jets in the 
event. We refer to these classes as IB and 2B (1C 



25 



and 2C) for Wbb (Wcc). By means of these combined 
Monte Carlo simulated samples, the heavy flavor frac- 
tions for VF+jets events are measured as the ratio 
between the computed IF+heavy flavor and W+jets 
cross sections. Jet data samples are used to correct 
for residual discrepancies between data and Monte 
Carlo simulations: a factor 1.5 ± 0.4 is applied to the 
Wbb and Wcc fractions [7], where the uncertainty is 
dominated by the systematic uncertainties associated 
with the ALPGEN heavy flavor calculations. The final 
heavy flavor fractions are shown in Table XVI. 

The contribution of VF+heavy flavor production to 
the pretag lepton+jets sample is estimated by multi- 
plying heavy flavor fractions by the observed number 
of events in the pretag sample, corrected for the non- 
W, and electroweak background expectations. The 
results are shown in Table XVIII. 

The above Monte Carlo simulated samples after 
pretag selection are used to compute the tagging ef- 
ficiencies. In order to avoid double counting of the 
mistag background, the jet probability algorithm is 
applied only to jets known to be due to a b or c quark. 
Each tagged jet is weighted according to the scale 
factor. Results are summarized in Table XVII. The 
systematic uncertainties are dominated by the uncer- 
tainties on the scale factor for b and c jets. The pretag 
expectations are multiplied by the tagging efficiencies 
to estimate the contributions of these processes to the 
tagged sample. The numbers of Wbb, Wcc and Wc 
events expected in the tagged lepton+jets sample for 
Pj < 1% (5%) are shown, along with the rest of the 
backgrounds, in Table XIX (Table XX) and account 
for 12.3% (13.2%) of the observed number of events 
in the signal region. 



D. Mistag Background 

Events in which jets from light partons are tagged 
as heavy flavor jets can contribute to the tagged sam- 
ple. The number of events with negative tags in the 
pretag sample would be a simple estimate of this 
background, but this method has the problem of a 
large statistical uncertainty. Instead, we count the 
events in the pretag sample weighted by their proba- 
bility to have at least one mistagged jet. This proba- 
bility is computed by applying the negative tag rate 
matrix to all the taggable jets in the event. 

This estimate is corrected for the mistag asymme- 
try derived in Section IV Bl. In order to take into 
account the dependence of the mistag asymmetry on 
the jet Et, we convolute it with the jet Et spectra 
in events with VT+three or more jets for data and tt 
Monte Carlo simulation, as shown in Fig. 16. The ob- 



served difference between the means of the distribu- 
tions and the mistag asymmetry values measured in 
Section IV B 1 are negligible within the uncertainties. 
We therefore decide to use the former in our analy- 
sis. The RMS of the distributions gives an estimate 
of how much the asymmetry changes over the jets in 
our samples, and it is taken as an additional uncer- 
tainty on the mistag asymmetry. The final mistag 
asymmetry scale factors are 1.56 ± 0.17 for Pj < 1% 
and 1.27 ± 0.20 for Pj < 5%. 




1.5 1.6 1.7 1.8 1.9 2 

Mistag Asymmetry 



1 1.2 1.3 1.4 1.5 1.6 1.7 

Mistag Asymmetry 



FIG. 16: Mistag asymmetry distribution for jets in data 
(dots) and tt Monte Carlo simulated events (histogram) 
in the signal region for Pj < 1% (5%) in the left (right) 
side. 



The estimate of the mistag background is also 
scaled down by one minus the fraction of pretag 
events which are due to non-IF, and electroweak 
backgrounds. The contribution of the mistag back- 
ground to the lepton+jets sample when a jet with 
Pj < 1% (5%) is required is shown in Table XIX 
(Table XX) and accounts for 12.8% (27.9%) of the 
observed number of events in the signal region. 



1. Mistag Cross Check 

The negative tag rate matrix has been extensively 
tested on inclusive jet samples. Results are discussed 
in Section IV B. The mistag matrix is found to 
correctly predict the number of events with nega- 
tively tagged jets observed in independent samples 
to within a few percent. To further test the mistag 
matrix reliability on lepton+jets data, we select a 
subsample of events by requiring $ T < 20 GeV. This 
sample is expected to be dominated by QCD jet pro- 
duction with the high-pT lepton signature provided 
by a jet. Figure 17 compares the observed number of 
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TABLE XVI: Summary of Wbb, Wcc and Wc fractions. IB and 2B (1C and 2C) indicate the Wbb (Wcc) events with 
one and two 6-jets (c-jets) reconstructed, respectively. Uncertainties are statistical only. 



Jet Multiplicity 


1 jet 


2 jets 


3 jets 


> 4 jets 


IB 


0.010 ± 0.003 


0.014 ± 0.004 


0.024 ± 0.006 


0.022 ± 0.006 


2B 




0.014 ± 0.004 


0.023 ± 0.006 


0.026 ± 0.007 


1C 


0.016 ± 0.004 


0.024 ± 0.006 


0.038 ± 0.010 


0.035 ± 0.010 


2C 




0.018 ± 0.005 


0.029 ± 0.008 


0.037 ± 0.010 


Wc 


0.043 ± 0.009 


0.060 ± 0.013 


0.060 ± 0.013 


0.059 ± 0.013 



TABLE XVII: Jet probability tagging efficiencies for Wbb, Wcc and Wc events. The first uncertainty is statistical, 
while the second is systematic. 

Jet Multiplicity 1 jet 2 jet 3 jet > 4 jets 

Event Tagging Efficiencies (%), Pj < 1% 

lB(>ltag) 29.5 ± 0.3 ± 2.5 30.7 ± 0.6 ± 2.6 37.0 ± 1.5 ± 3.2 33.5 ± 3.2 ± 2.9 

2B (>1 tag) 50.5 ± 0.7 ± 4.3 56.0 ± 1.6 ± 4.8 54.6 ± 2.2 ± 4.7 

1C (>1 tag) 6.8 ± 0.2 ± 0.9 7.8 ± 0.4 ± 1.0 9.1 ± 1.0 ± 1.2 8.0 ± 1.9 ± 1.0 

2C (>1 tag) 13.5 ± 0.6 ± 1.7 16.8 ± 1.5 ± 2.2 14.3 ± 1.7 ± 1.8 

Wc (>1 tag) 7.2 ± 0.2 ± 0.9 7.9 ± 0.3 ± 1.0 8.3 ± 0.9 ± 1.1 6.9 ± 1.1 ± 0.9 

Event Tagging Efficiencies (%), Pj < 5% 

IB (>1 tag) 39.6 ± 0.3 ± 3.4 40.8 ± 0.6 ± 3.4 47.1 ± 1.6 ± 4.0 41.4 ± 3.4 ± 3.5 

2B (>1 tag) 63.5 ±0.7 ± 5.4 68.4 ± 1.5 ± 5.8 66.7 ± 2.1 ± 5.6 

1C (>1 tag) 14.8 ± 0.3 ± 1.9 17.2 ± 0.5 ± 2.2 18.7 ± 1.4 ± 2.4 16.3 ± 2.6 ± 2.1 

2C (>1 tag) 27.1 ± 0.8 ± 3.4 33.2 ± 1.9 ± 4.2 32.3 ± 2.3 ± 4.1 

Wc (>1 tag) 15.3 ± 0.3 ± 1.9 16.4 ± 0.5 ± 2.1 19.4 ± 1.3 ± 2.5 18.6 ± 1.7 ± 2.4 



events with negative tags and the matrix prediction 
as a function of the jet multiplicity. Good agreement 
is observed confirming the reliability of the mistag 
matrix to predict the negative tag rate in events dom- 
inated by prompt jets. We repeat the test on the 
pretag lepton+jets sample, where a I$ T > 20 GeV 
requirement is applied. Results are shown in Fig. 18. 
We observe a discrepancy between observed and pre- 
dicted negative tags which we attribute to the higher 
fraction of heavy flavor in lepton+jets events with 
high value of I$ T with respect to the inclusive jet sam- 
ples where the matrix has been computed. To corrob- 
orate this hypothesis, we make a first-order correc- 
tion to the mistag prediction by using the heavy fla- 
vor fractions (fx) in VF±jets events (see Table XVI). 
We compute the negative tag rates for light (Mi) and 
heavy (Mh) flavor jets in ti Monte Carlo simulation. 
For each jet multiplicity bin, a scale factor R is then 
determined as: 

R = (l-£/r) + £/rCT, (15) 
T = IB, 2B, 1C, 2C, Wc, (16) 



where 

i-(i-M t y- k (i-M h ) k 
Ct = 1 - (i - Ml y • (17) 

The numbers j and k in the formula are the jet mul- 
tiplicity and the number of heavy flavor jets (k = 1 
for T = IB, 1C or Wc and k = 2 for T = 2B or 2C), 
respectively. The corrected distributions, also shown 
in Fig. 18, show a much better agreement with the 
observed rates of negative tagged events. We there- 
fore use the mistag matrix prediction corrected by 
the mistag asymmetry as an estimate of the number 
of events with a tagged light jet. 



E. Background Summary 

Table XVIII summarizes the contributions of the 
different background estimates in the pretag sample. 
The difference between the observed number of events 
and the total background estimate is due to W^+light 
flavor and ti contributions. 

Table XIX and Table XX summarize the contri- 
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TABLE XVIII: Summary of the background estimate in the pretag sample. The difference between the total background 
estimate and the observed number of events is due to W+light flavor and ti contributions. 



Jet Multiplicity 


1 jet 


2 jets 


3 jets 


> 4 jets 


Elcctroweak 


WW 


127 ± 9 


123 ± 9 


10.0 ± 0.8 


3.6 ± 0.3 


WZ 


16.8 ± 1.2 


18.8 ± 1.4 


1.7 ± 0.1 


0.61 ± 0.06 


ZZ 


0.67 ± 0.05 


0.68 ± 0.05 


0.14 ± 0.02 


0.052 ± 0.008 


Single Top W — g 


13.4 ± 1.1 


13.6 ± 1.1 


1.44 ± 0.12 


0.38 ± 0.04 


Single Top W* 


4.0 ± 0.4 


7.9 ± 0.7 


1.02 ± 0.10 


0.24 ± 0.02 


Z -» T+T- 


87 ± 7 


16.5 ± 1.7 


1.0 ± 0.3 


± 


Total 


249 ± 18 


180 ± 13 


15.3 ± 1.2 


4.9 ± 0.4 




W 


+ Heavy Flavor 






Wbb 


281 ± 75 


116 ± 31 


12.9 ± 3.3 


7.4 ± 2.0 


Wcc 


459 ± 123 


170 ± 46 


18.4 ± 5.0 


11.1 ± 3.1 


Wc 


1197 ± 252 


243 ± 53 


16.9 ± 3.6 


9.1 ± 2.0 


Total 


1938 ± 322 


530 ± 94 


47.8 ± 9.0 


27.5 ± 5.5 


Others 


Non-W 


1250 ± 626 


208 ± 104 


10.0 ± 5.3 


7.3 ± 4.1 


Total Background 


3436 ± 741 


917 ± 150 


73 ± 11 


39.7 ± 7.2 


Data 


29339 


4442 


300 


166 



butions of the different background sources in the 
tagged lepton+jets sample for Pj < 1% and Pj < 
5% respectively. 

We observe good agreement between data and 
background predictions in events with one and two 
jets, which supports the validity of our background 
estimates. In events with three or more jets, we 
observe an excess of tagged events in data which 
we attribute to ti events. The estimates of the 
W+heavy flavor and mistag background contribu- 
tions have been normalized to the data in the pretag 
sample assuming no signal. Having actually observed 
a significant number of ti events in the tagged sam- 
ple, we need to correct those estimates by the num- 
ber of signal events in the pretag sample. We make 
this correction through an iterative procedure which 
is described in Section VIII. 



VII. SIGNAL ACCEPTANCE 

The signal acceptance, or ti event detection effi- 
ciency, is defined as the fraction of ti events that 
satisfy all selection requirements, and includes trig- 
ger and reconstruction efficiencies as well as the ef- 
ficiencies of the kinematic selection and of the b- 
tagging algorithm. We measure it using a PYTHIA 
ti Monte Carlo sample generated with a top quark 
mass m t = 178 GeV/c 2 and simulated as discussed 



in Section III. Wherever possible, effects which are 
not sufficiently well modeled in the simulation are 
measured using data. The acceptance is defined as 

e ti — (Ati X Klep X i-trig X £z X e zvtx ) X €b-tag 

= <%* tag x e b - tag (18) 

where A t i is the fraction of Monte Carlo simulated 
ti events which pass the kinematic requirements (ex- 
cept fe-tagging) and includes the branching fraction 
for ti — > e/n+jets, the lepton identification efficiency 
(including isolation and cosmic/conversion veto effi- 
ciency, as described in Section III), the dilepton and 
Z° — > l + l~ veto efficiencies, and the kinematic and 
geometric acceptances. A t i is measured separately 
for electron and muon events. Ki ep is a scale factor 
which takes into account the difference in lepton iden- 
tification efficiency between data and Monte Carlo 
simulations estimated using Z — ► l + l~ events; e tr i g 
is the trigger efficiency for identifying high px lep- 
tons and is measured using data from independent 
triggers. Both Ki ep and e tr i g are discussed in Sec- 
tion III. e zo and e zvtx ar e the efficiencies for the z 
vertex cuts described in Section V and tb-tag is the 
efficiency to tag at least one tight jet in a ti event 
and includes a tagging scale factor to account for dif- 
ferences between Monte Carlo simulations and data. 

The event tagging efficiency eb-tag is obtained from 
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TABLE XIX: Summary of the background estimate in the lepton+jets sample when a jet with Pj < 1% is required. 



Jet Multiplicity 


1 jet 


2 jets 


3 jets 


> 4 jets 


Elcctroweak 


WW 


2.2 ± 0.3 


5.0 ± 0.6 


0.7 ± 0.1 


0.28 ± 0.06 


WZ 


1.0 ± 0.1 


2.0 ± 0.2 


0.23 ± 0.03 


0.09 ± 0.02 


ZZ 


0.027 ± 0.006 


0.09 ± 0.01 


0.012 ± 0.004 


0.007 ± 0.002 


Single Top W-g 


4.1 ± 0.5 


4.9 ± 0.6 


0.73± 0.08 


0.20 ± 0.02 


Single Top W* 


1.3 ± 0.2 


4.2 ± 0.5 


0.60 ± 0.07 


0.14 ± 0.02 


Z -» T+T- 


0.7 ± 0.3 


0.4 ± 0.2 


0.04 ± 0.04 


± 


Total 


9.3 ± 1.1 


16.6 ± 1.8 


2.3 ± 0.3 


0.71 ± 0.09 


W + Heavy Flavor 


Wbb 


83 ± 23 


47 ± 13 


6.0 ± 1.6 


3.3 ± 0.9 


Wcc 


31 ± 9 


17.5 ± 5.2 


2.3 ± 0.7 


1.2 ± 0.4 


Wc 


86 ± 21 


19.2 ± 5.0 


1.4 ± 0.4 


0.6 ± 0.2 


Total 


200 ± 42 


84 ± 20 


9.6 ± 2.4 


5.2 ± 1.4 


Others 


Mistag 


149 ± 17 


51.8 ± 5.9 


8.5 ± 1.0 


6.7 ± 0.8 


Non-W 


31 ± 16 


8.6 ± 4.6 


0.9 ± 0.6 


0.5 ± 0.5 


Total Background 


389 ± 49 


161 ± 22 


21.4 ± 2.7 


13.1 ± 1.7 


Data 


350 


191 


52 


68 



TABLE XX: Summary of the background estimate in the lepton+jets sample when a jet with Pj < 5% is required. 



Jet Multiplicity 


1 jet 


2 jets 


3 jets 


> 4 jets 


Elcctroweak 


WW 


5.5 ± 0.6 


12.5 ± 1.4 


1.81 ± 0.21 


0.74 ± 0.10 


WZ 


1.6 ± 0.2 


3.3 ± 0.3 


0.40 ± 0.05 


0.16 ± 0.02 


ZZ 


0.049 ± 0.009 


0.14 ± 0.02 


0.027 ± 0.006 


0.014 ± 0.004 


Single Top W-g 


5.4 ± 0.6 


6.5 ± 0.7 


0.92 ± 0.10 


0.26 ± 0.03 


Single Top W* 


1.7 ± 0.2 


5.2 ± 0.6 


0.74 ± 0.08 


0.17 ± 0.02 


Z -» T+T- 


2.1 ± 0.5 


1.1 ± 0.3 


0.13 ± 0.10 


± 


Total 


16.3 ± 1.8 


28.8 ± 3.0 


4.0 ± 0.4 


1.4 ± 0.1 


W + Heavy Flavor 


Wbb 


111 ± 31 


61 ± 17 


7.4 ± 2.0 


4.1 ± 1.2 


Wcc 


68 ± 20 


36 ± 11 


4.6 ± 1.4 


2.7 ± 0.8 


Wc 


184 ± 45 


40 ± 10 


3.2 ± 0.8 


1.7 ± 0.5 


Total 


363 ± 75 


137 ± 31 


15.2 ± 3.6 


8.5 ± 2.1 


Others 


Mistag 


585 ± 92 


194 ± 30 


28.2 ± 4.4 


22.1 ± 3.5 


Non-VF 


69 ± 35 


21 ± 11 


1.3 ± 0.9 


0.79 ± 0.74 


Total Background 


1033 ± 125 


381 ± 46 


48.8 ± 5.9 


32.7 ± 4.2 


Data 


975 


385 


87 


93 



the same tt Monte Carlo simulated sample. We com- 
pute, for each tt event, the probability of having n 
tagged jets in the event by assigning to each jet a 
probability to be tagged. The sum of these prob- 
abilities over all the events returns the number of 



expected events with at least n tags, from which we 
calculate the tagging efficiency. For light flavor jets, 
this probability is computed using the mistag ma- 
trix, while for heavy flavor jets the probability is the 
value of the tagging scale factor (see Section IV) if 
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TABLE XXI: Summary of acceptances for ti events. Efficiencies are expressed as percentages. The average eb-tag 
is the luminosity- weighted CEM/CMUP/CMX tagging efficiency. First uncertainty is statistical and the second one 
corresponds to systematics. 



Quantity 


CEM 


CMUP 


CMX 


^pretag 


3.67 ± 0.02 ± 0.22 


1.92 ± 0.01 ± 0.12 


0.751 ± 0.008 ± 0.046 


J Ldt (pb" 1 ) 


318 ± 19 


318 ± 19 


305 ± 18 


Single Tag, P, < 1%, SF = 0.817 ± 0.070 


^b— tag 


54.7 ± 0.2 ± 3.6 


54.1 ± 0.3 ± 3.5 


55.2 ± 0.5 ± 3.6 


Average e b -ta g 




54.5 ± 0.2 ± 3.6 






2.00 ± 0.01 ± 0.18 


1.04 ± 0.01 ± 0.09 


0.41 ± 0.01 ± 0.04 


eujLdtipb- 1 ) 


6.38 ± 0.04 ± 0.68 


3.30 ± 0.03 ± 0.36 


1.32 ± 0.02 ± 0.14 


Total e t tj Ldt 




11.00 ± 0.05(stat) ± 1.17(syst) pb~ 


-i 




Single Tag, Pj 


< 5%, SF = 0.852 ± 0.072 




£b—tag 


68.8 ± 0.2 ± 3.7 


68.6 ± 0.3 ± 3.7 


69.6 ± 0.5 ± 3.7 


Average e b -t ag 




68.8 ± 0.2 ± 3.7 






2.52 ± 0.01 ± 0.20 


1.315 ± 0.009 ± 0.108 


0.523 ± 0.006 ± 0.042 


e ti jLdt (pb- 1 ) 


8.03 ± 0.05 ± 0.80 


4.19 ± 0.03 ± 0.42 


1.67 ± 0.02 ± 0.17 


Total e t tj Ldt 




13.89 ± 0.06(stat) ± 1.38(syst) pb~ 


-l 



the jet is tagged and zero otherwise. We estimate 
the systematic uncertainty on the event tagging effi- 
ciency by varying the tagging scale factor and mistag 
prediction by ±lo\ 

Table XXI summarizes the acceptance for ti events. 
For Pj cuts of 1% and 5%, the combined accep- 
tance times integrated luminosity are, respectively, 
11.00 ± 0.05(stat) ± 1.17(syst) pb" 1 and 13.89 ± 
0.06(stat) ± 1.38(syst) pb" 1 , where the statistical un- 
certainty is uncorrelated between the lepton types, 
and the systematic uncertainty is assumed to be 100% 
correlated since it is dominated by the luminosity and 
tagging scale factor uncertainties. 

Table XXII summarizes the contributions to the 
systematic uncertainty on the signal acceptance. 
Trigger, lepton identification and z vertex cuts have 
already been discussed in Sections III and V. The 
observed difference in the conversion veto efficiency 
between ti events and the Z — > e + e~ sample used 
to measure the electron identification scale factor is 
added as an uncertainty on the tight electron iden- 
tification efficiency. The efficiency of the cosmic ray 
veto is measured from data and accounts for a 1% un- 
certainty on the tight muon identification efficiency. 
Additional uncertainties in the electron (muon) ac- 
ceptance are due to Et (pt) scale, Et (pt) resolu- 
tion and material (geometrical) effects, and are found 
to be 0.3% (1.2%) in inclusive W events. The lep- 
ton isolation uncertainty accounts for differences in 
the modeling of the lepton identification in events 
with different jet multiplicity. It has been evaluated 



by comparing data to Monte Carlo simulations for 
VK+jets and ti events. The uncertainty due to the 
jet energy scale is estimated by the shift in signal ac- 
ceptance observed by changing the jet energy correc- 
tions within their uncertainties. The uncertainty due 
to parton distribution functions (PDF) is estimated 
by re-weighting the ti events generated with CTEQ5L 
for different sets of PDFs [28]. In particular, we con- 
sider the difference in signal acceptance between NL0 
CTEQ6M and CTEQ5L, between MRST for two different 
values of a Sl and between NL0 CTEQ6M and the 20 
CTEQ eigenvectors, and we add in quadrature all the 
contributions. Differences in the modeling of ti pro- 
duction and decay are evaluated as the difference in 
acceptance between samples of signal events gener- 
ated with HERWIG and PYTHIA. Samples of ti events 
with different levels of initial and final state radia- 
tion (ISR/FSR) are used to evaluate the effect of this 
source of uncertainty on the signal acceptance. The 
systematic uncertainty on the event tagging efficiency 
is estimated by varying the tagging scale factor and 
the mistag prediction by ± la. The total systematic 
uncertainty on the signal acceptance is 8.9% (8.0%) 
for Pj < 1% (5%), and is dominated by the tagging 
scale factor and the jet energy scale uncertainties. 



30 



TABLE XXII: Summary of the systematic uncertainties on the signal acceptance. The second column quotes the 
relative uncertainty on the indicated quantities, while the third column shows the effect on the overall tt acceptance. 



Source 


Relative Uncertainty (%) 


Uncertainty on the Acceptance (%) 


Central Electron 1 rigger 


0.6 


0.3 


Central Electron ID SF 


0.5 


0.3 


Conversion Veto Eff. 


1.4 


0.8 


Et Scale of Electron 


0.3 


0.2 


Central Muon 1 rigger 


0.5 


0.2 


Central Muon ID SF 


1.0 


0.3 


CMX Muon Trigger 


0.4 


0.05 


CMX Muon ID SF 


0.6 


0.07 


Cosmic Veto Eft. 


1.0 


0.4 


Pt Scale of Muon 


1 o 
l.z 


U.o 


Lcpton Isolation 


2.0 


2.0 


\Z vtx \ Cut Eff. 


0.3 


0.3 


Z^ Prob Cut Eff. 


2 


2 


Jet Energy Scale 




4.2 


PDF 




2 


MC Modeling 




1.6 


ISR/FSR 




1.3 


Tagging SF Pj < 1% (b's/c's) 


8.6/12.9 


6.5 


Mistag Asymmetry Pj < 1% 


11.0 


0.2 


Tagging SF Pj < 5% (b's/c's) 


8.5/12.7 


5.4 


Mistag Asymmetry Pj < 5% 


15.5 


0.4 


Total Uncertainty {Pj < 1%) 




8.9 


Total Uncertainty (Pj < 5%) 




8.0 



VIII. CROSS SECTION FOR SINGLE 
TAGGED EVENTS 



We measure the cross section as 
N b s — N bck 



e t{ x / L dt ' 



(19) 



where N b. s is the observed number of events with 
at least one jet tagged, N bck is the background es- 
timate in the signal region, e t j is the signal accep- 
tance including the tagging efficiency and J L dt is 
the integrated luminosity. The estimated number of 
background events must be corrected for the tt con- 
tribution, since we normalize mistag and TU+heavy 
flavor backgrounds assuming no tt signal events in 
the pretag sample. We apply an iterative procedure 
in which we first estimate the number of tagged top 
candidates in the sample as the number of tagged 
signal events minus the total background in the > 
3 jet bins. Successively, the obtained signal cross 
section is used to estimate the number of tt events 
before the 6-tagging requirement, and this contribu- 
tion is subtracted from the total number of events to 
which we normalize the mistag, Wbb, Wcc and Wc 



backgrounds. The expectations for single top, dibo- 
son and Z^t + t~ do not change with the number of 
tt events in the signal region. The change for non- 
W background is found to be negligible compared to 
its uncertainty. Therefore, this background is also 
kept fixed. Having obtained a new estimate for the 
tagged background, we re-evaluate the number of tt 
candidates. The procedure is repeated until the cross 
section a t i changes by less than 0.1%. 

Starting with the backgrounds shown in Ta- 
bles XIX and XX, we apply the iterative procedure 
and measure 

a ti = 8.9tl;°(stat.)+^(syst.) pb 

for Pj < 1%. As a cross check, we apply the iterative 
procedure for Pj < 5% and measure 



o tl = 9.6t^(stat.)+^(syst.) pb. 

The final signal and background estimates are shown 
in Table XXIII, together with the observed number 
of events. 

Figure 19 compares the numbers of observed data 
to background and signal expectations, for Pj < 1% 
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TABLE XXIII: Summary of the final signal and background estimates and observed data in the single tag sample. 



Jet Multiplicity 


1 jet 


2 jets 


3 jets 


> 4 jets 


Prctag Data 


29339 


4442 


300 


166 


P.j < 1% 


Electroweak 


9.3 ± 1.1 


16.6 ± 1.8 


2.3 ± 0.3 


0.71 ± 0.09 


Wbb 


83 ± 23 


47 ± 13 


4.3 ± 1.2 


1.1 ± 0.3 


Wcc 


31 ± 9 


17.3 ± 5.2 


1.6 ± 0.5 


0.4 ± 0.1 


Wc 


86 ± 21 


19.0 ± 4.9 


1.0 ± 0.3 


0.21 ± 0.06 


Mistag 


149 ± 17 


51 ± 6 


6.1 ± 0.7 


2.2 ± 0.3 


Non-W 


31 ± 16 


8.6 ± 4.6 


0.9 ± 0.6 


0.5 ± 0.5 


Total Background 


389 ± 49 


159 ± 22 


16.3 ± 2.0 


5.1 ± 0.7 


tt (8.9 pb) 


2.5 ± 0.5 


20.6 ± 2.4 


40.4 ± 4.5 


58.1 ± 6.2 


Data 


350 


191 


52 


68 


Pj < 5% 


Electroweak 


16.3 ± 1.8 


28.8 ± 3.0 


4.0 ± 0.4 


1.4 ± 0.1 


Wbb 


111 ± 31 


60 ± 17 


5.2 ± 1.4 


1.1 ± 0.3 


Wcc 


68 ± 20 


36 ± 11 


3.2 ± 1.0 


0.76 ± 0.24 


Wc 


184 ± 45 


40 ± 10 


2.2 ± 0.6 


0.5 ± 0.13 


Mistag 


585 ± 92 


191 ± 30 


19.6 ± 3.1 


6.1 ± 1.0 


Non-W 


69 ± 35 


21 ± 11 


1.3 ± 0.9 


0.8 ± 0.7 


Total Background 


1033 ± 125 


377 ± 46 


35.5 ± 4.2 


10.6 ± 1.4 


tt (9.6 pb) 


3.6 ± 0.6 


28.4 ± 3.1 


55.1 ± 5.7 


78.6 ± 7.8 


Data 


975 


385 


87 


93 



and 5%, for the measured tt production cross sec- 
tions. 

The statistical uncertainty on the measured cross 
section is dominated by the data sample size. Ta- 
ble XXIV summarizes the systematic contributions 
to the cross section uncertainty. The correlations 
in acceptances, tagging scale factor and Juminosity 
uncertainty are taken into account. Wbb and Wcc 
systematics are considered correlated across all the 
bins. All the other uncertainties are treated as un- 
corrected. 



sured cross sections as a function of the top mass 
returns a slope of -0.052 ± 0.008 pb/(GeV/c 2 ) and 
-0.066 ± 0.008 pb/(GeV/c 2 ) for Pj <1% and 5% re- 
spectively, where the uncertainties are due to Monte 
Carlo simulation statistics. Note that the fit results 
for m t = 178 GcV/c 2 agree with the measured cross 
section within the 1.6% uncertainty estimated in Sec- 
tion VII due to different modeling in PYTHIA and 
HERWIG. 



A. tt Cross Section Dependence on the Top 
Quark Mass 

The signal acceptance used in this analysis has 
been computed using a sample of tt events genera- 
ted with PYTHIA for m t = 178 GeV/c 2 , which co- 
rresponds to the combined Run I top mass measure- 
ment at the Tevatron Collider [3]. We study the de- 
pendence of the ti cross section on the top quark 
mass by reevaluating the signal acceptance through 
a set of Monte Carlo simulated samples generated 
by HERWIG for different values of the top mass. Re- 
sults are shown in Fig. 20. A linear fit to the mea- 



B. Electron versus Muon tt Cross Section 
Measurements 



As an additional cross check, we measure the cross 
section separately for events with tight electrons and 
muons. Table XXV summarizes the cross sections 
for the two analyses with Pj < 1% and Pj < 5%. 
The cross section measurements in the electron and 
muon+jets samples agree within their statistical un- 
certainty. 
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TABLE XXIV: Summary of the systematic uncertainties in the single tag analysis. 



Source 


Fractional Syst. Uncert. (%) 


Contribution to er t j (%) 






Pj < 1% 


Pj < 5% 


Central Electron ID 


1.0 


+0.99/-0.97 


+1.00/-0.98 


Central Muon ID 


1.9 


+0.61/-0.61 


+0.62/-0.61 


CMX Muon ID 


1.8 


+0.22/-0.22 


+0.22/-0.22 


PDF 


2 


+2.1/-2.0 


+2.1/-2.0 


Jet Energy Scale 


4.2 


+4.5/-4.2 


+4.6/-4.2 


Lepton Isolation 


o 
z 


I O 1 / o n 
H-z.l/-z.U 


_i_o 1 / o n 
H-Z.1/-Z.U 


ISR/FSR 


1.3 


+1.4/-1.3 


+ 1.4/-1.3 


MC Modeling 


1.6 


+1.7/-1.6 


+ 1.7/-1.6 


Z Vertex 


2.0 


+2.1/-2.1 


+2.2/-2.1 


Tagging SF Pj < 1% (b's/c's) 


8.6/12.9 


+8.2/-7.2 




Tagging SF Pj < 5% (b's/c's) 


8.5/12.7 




+7.0/-6.3 


Mistag Asymmetry Pj < 1% 


11.0 


+0.93/-0.93 




Mistag Asymmetry Pj < 5% 


15.5 




+3.0/-3.0 


Non-IU Fraction 


50 


0.33 


0.56 


Non-lU Prediction 


50 


0.71 


0.79 


W+BF Prediction 


30 


2.6 


2.9 


Cross Sections Bck. 


1.8 


0.056 


0.072 


Luminosity 


5.9 


+6.5-5.7 


+6.5-5.8 


Total Systematic Uncertainty 




+ 12.5/-11.3 


+12.3/-11.3 



TABLE XXV: Summary of the cross sections for Pj < 1% and Pj < 5% and for each lepton type. Results are expressed 
in pb. 







Total 


Electrons 


Muons 


Pj 

Pj 


< 1% 

< 5% 


8.9+{^(stat.)+^(syst.) 
9.6j;o : 9(stat.)A^(syst.) 


8.6+{^(stat.)±^(syst.) 
9.4t^(stat.)tl:?(syst.) 


9.4+{-Kstat.n^(syst.) 
9.9ti;t(stat.)tl:?(syst.) 



IX. CROSS SECTION FOR DOUBLE 
TAGGED EVENTS 



The measurement of the ttbar cross section in a 
sample with at least two b tags follows the same pro- 
cedure as the single tag analysis, with a much purer 
sample of tt events. As shown in Table X, after re- 
quiring the event selection described in Section V, we 
observe 30 (61) events with two 6-tagged jets out of 
the 120 (180) events with at least one 6-tagged jet 
for Pj < 1% (Pj < 5%). Table XXVI shows the 
signal acceptances and the efficiencies to tag two jets 
in signal events passing the pretag selection. The 
total acceptance times luminosity for Pj < 1% and 
Pj < 5% is 2.57 ± 0.02(stat) ± 0.49(syst) pb^ 1 and 
4.92 ± 0.04(stat) ± 0.87(syst) pb" 1 respectively. 



A. Backgrounds in the Double b-Tag Sample 

A few differences with respect to the single tag 
analysis must be taken into account in order to esti- 
mate the backgrounds. We define the mistag back- 
ground as the events with at least two mistagged jets. 
The negative tag rate matrix is applied to negatively 
taggable jets in the event and the probability to have 
at least two mistagged jets is summed over all events. 
The mistag prediction is scaled by the fraction of 
non-lU, electroweak backgrounds and by the mistag 
asymmetry as is done for the single tag analysis. 

Events with one real heavy flavor tag plus a mistag 
are included in the other background sources. The 
contribution of mistags to the TU+heavy flavor back- 
ground is taken into account by applying the mistag 
rate matrix to light flavor jets in events with an ex- 
tra real tag when computing the tagging efficiency. 
Results are summarized in Table XXVII. 

The strategy to estimate the non-W background 
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TABLE XXVI: Summary of acceptances for ti events. Efficiencies are expressed as percentages. The first uncertainty 
quoted is statistical and the second is systematic. The average et-tag is the luminosity-weighted CEM/CMUP/CMX 
tagging efficiency. 



On antitv 


CEM 


CMUP 


CMX 


g protag 

fhdt (pb- 1 ) 


3.67 ± 0.02 ± 0.22 


1.92 ± 0.01 ± 0.12 


0.751 ± 0.008 ± 0.046 


318 ± 19 


318 ± 19 


305 ± 18 




Double Tag, Pj 


< 1%, SF = 0.817 ± 0.070 




^b—tag 


12.7 ± 0.2 ± 2.1 


12.6 ± 0.2 ± 2.0 


13.4 ± 0.4 ± 2.2 


Average e b - tag 




12.7 ± 0.1 ± 2.1 






0.465 ± 0.006 ± 0.081 


0.241 ± 0.004 ± 0.042 


0.101 ± 0.003 ± 0.018 


eufLdt (pb" 1 ) 


1.48 ± 0.02 ± 0.27 


0.77 ± 0.01 ± 0.14 


0.32 ± 0.01 ± 0.06 


Total e« J Ldt 




2.57 ± 0.02(stat) ± 0.49(syst) pb" 1 






Double Tag, Pj 


< 5%, SF = 0.852 ± 0.072 




£b-tag 


24.4 ± 0.2 ± 3.6 


24.1 ± 0.3 ± 3.6 


25.2 ± 0.5 ± 3.7 


Average e b ^ tag 




24.4 ± 0.2 ± 3.6 






0.895 ± 0.009 ± 0.142 


0.462 ± 0.006 ± 0.074 


0.189 ± 0.004 ± 0.030 


e ti jLdt (pb" 1 ) 


2.85 ± 0.03 ± 0.48 


1.47 ± 0.02 ± 0.25 


0.60 ± 0.01 ± 0.10 


Total e« /' Ldt 




4.92 ± 0.04(stat) ± 0.87(syst) pb" 1 





TABLE XXVII: Jet probability tagging efficiencies for Wbb, Wcc and Wc events for double tagged events. Values are 
expressed as percentages. The first uncertainty quoted is statistical and the second is systematic. IB and 2B (1C and 
2C) refer to Wbb (Wcc) events with one and two reconstructed heavy flavor jets respectively. 

jet multiplicity 2 jets 3 jets > 4 jets 

Double Tag Tagging Efficiencies, Pj < 1% 

IB (>2 tags) 0.27 ± 0.06 ± 0.05 0.90 ± 0.30 ± 0.15 1.3 ± 0.8 ± 0.2 

2B (>2 tags) 10.3 ± 0.4 ± 1.8 13. 1± 1.1 ± 2.2 14.1 ± 1.5 ± 2.4 

1C (>2 tags) 0.067 ± 0.037 ± 0.017 0.23 ± 0.17 ± 0.06 0.29 ± 0.38 ± 0.08 

2C (>2 tags) 0.43 ± 0.12 ± 0.11 1.3 ± 0.5 ± 0.3 1.1 ± 0.5 ± 0.3 

Wc (>2 tags) 0.05 ± 0.03± 0.01 0.20 ± 0.14 ± 0.05 0.22 ± 0.21 ± 0.06 

Double Tag Tagging Efficiencies, Pj < 5% 

IB (>2 tags) 1.3 ± 0.1 ± 0.2 3.7 ± 0.6 ± 0.6 5.0 ± 1.5 ± 0.8 

2B (>2 tags) 18.6 ± 0.6 ± 3.1 23.9 ± 1.4 ± 4.0 26.0 ± 1.9 ± 4.4 

1C (>2 tags) 0.54 ± 0.11 ± 0.14 1.6 ± 0.4 ± 0.4 1.8 ± 0.9 ± 0.5 

2C (>2 tags) 2.5 ± 0.3 ± 0.6 5.6 ± 0.9 ± 1.4 6.3 ± 1.2 ± 1.6 

Wc (>2 tags) 0.40 ± 0.08 ± 0.10 1.5 ± 0.4 ± 0.4 2.1 ± 0.6 ± 0.5 



is changed, compared to that used for the sin- 
gle tag sample, due to low statistics in the double 
tagged event sample in the sideband regions (see Sec- 
tion VI B). We compute a common tag rate for all 
the jet multiplicity bins by using data in region B 
(isolation < 0.1 and $ T < 15 GeV). We divide the 
total number of double tagged events by the sum of 
the number of pretag events scaled by the jet pair 
multiplicity. Finally, we apply this tag rate to the 
pretag expectation in the signal region derived in Sec- 
tion VI Bl. 

Background predictions for Pj < 1% and Pj < 



5% are compared to the data in Tables XXVIII 
and XXIX, respectively. 

The iterative procedure described in Section VIII 
is applied, and we obtain a cross section of 

a ti = ll.lt?-j(stat.)l"(syst.) pb 
for Pj < 1% and 

( 7 tf =11.6ll;J(stat.)l^(syst.) P b 
for Pj < 5%. Signal and background estimates after 
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TABLE XXVIII: Summary of the background estimate in the double tag sample for Pj < 1%. 



Jet Multiplicity 


2 jets 


3 jets 


> 4 jets 


Elcctrowcak 


WW 


0.05 ± 0.02 


0.03 ± 0.02 


0.006 ± 0.006 


WZ 


0.25 ± 0.05 


0.03 ± 0.01 


0.013 ± 0.006 


ZZ 


0.014 ± 0.005 


0.001 ± 0.001 


0.001 ± 0.001 


Single Top W-g 


0.17 ± 0.03 


0.12 ± 0.03 


0.05 ± 0.01 


Single Top W* 


0.88 ± 0.17 


0.14 ± 0.03 


0.035 ± 0.007 


Z -> t+t- 


0.06 ± 0.06 


± 


± 


Total 


1.4 ± 0.3 


0.33 ± 0.06 


0.10 ± 0.02 


W + Heavy Flavour 


Wbb 


6.2 ± 2.0 


0.89 ± 0.29 


0.61 ± 0.21 


Wcc 


0.38 ± 0.17 


0.13 ± 0.06 


0.077 ± 0.046 


Wc 


0.13 ± 0.08 


0.03 ± 0.03 


0.02 ± 0.02 


Total 


6.7 ± 2.1 


1.1 ± 0.3 


0.71 ± 0.24 


Others 


Mistag 


0.21 ± 0.05 


0.10 ± 0.02 


0.12 ± 0.03 


Non-W 


0.19 ± 0.12 


0.03 ± 0.02 


0.05 ± 0.03 


Total Background 


8.5 ± 2.3 


1.5 ± 0.4 


0.97 ± 0.25 


Data 


13 


12 


18 



TABLE XXIX: Summary of the background estimate in the double tag sample for Pj < 5%. 



Jet Multiplicity 


2 jets 


3 jets 


> 4 jets 


Elcctrowcak 


WW 


0.29 ± 0.06 


0.13 ± 0.04 


0.07 ± 0.03 


WZ 


0.51 ± 0.10 


0.06 ± 0.02 


0.03 ± 0.01 


ZZ 


0.026 ± 0.007 


0.004 ± 0.002 


0.002 ± 0.001 


Single Top W-g 


0.39 ± 0.07 


0.23 ± 0.04 


0.09 ± 0.02 


Single Top W* 


1.5 ± 0.3 


0.26 ± 0.05 


0.06 ± 0.01 


Z -» t+t- 


0.07 ± 0.07 


± 


± 


Total 


2.83 ± 0.51 


0.70 ± 0.12 


0.25 ± 0.05 


W + Heavy Flavour 


Wbb 


11.5 ± 3.7 


1.8 ± 0.6 


1.2 ± 0.4 


Wcc 


2.4 ± 0.9 


0.61 ± 0.24 


0.45 ± 0.19 


Wc 


0.98 ± 0.38 


0.25 ± 0.11 


0.19 ± 0.09 


Total 


14.9 ± 4.7 


2.6 ± 0.8 


1.9 ± 0.6 


Others 


Mistag 


2.7 ± 0.9 


1.0 ± 0.3 


1.3 ± 0.4 


Non-W 


0.63 ± 0.34 


0.09 ± 0.05 


0.14 ± 0.09 


Total Background 


21.1 ± 5.1 


4.4 ± 0.9 


3.5 ± 0.7 


Data 


28 


22 


39 



the iterative procedure are shown in Table XXX, to- 
gether with the observed number of events. Figure 21 
compares the numbers of observed data to back- 
ground and signal expectations for Pj < 1% and 5% 
for the measured ti production cross sections. 



The statistical uncertainty on the measured cross 
section is dominated by the data sample size. Ta- 
ble XXXI summarizes the systematic contributions 
to the cross section uncertainty. 
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TABLE XXX: Summary of the final signal and background estimates and observed data in the double tag sample. MC 
derived refers to electroweak processes. 



Jet Multiplicity 


2 jets 




3 jets 


> 4 jets 


Prctag Data 


4442 




300 


166 




Pj 


< 1% 






MC Derived 


1.4 ± 0.3 




0.33 ± 0.06 


0.10 ± 0.02 


Wbb 


6.1 ± 1.9 




0.57 ± 0.19 


0.10 ± 0.03 


Wcc 


0.38 ± 0.17 




0.09 ± 0.04 


0.013 ± 0.008 


Wc 


0.12 ± 0.08 




0.02 ± 0.02 


0.003 ± 0.003 


Mistag 


0.21 ± 0.05 




0.06 ± 0.01 


0.019 ± 0.004 


Non-JV 


0.19 ± 0.12 




0.03 ± 0.02 


0.05 ± 0.03 


Total Background 


8.4 ± 2.2 




1.1 ± 0.3 


0.28 ± 0.06 


tt (11.1 pb) 


3.9 ± 0.9 




10.2 ± 2.0 


18.4 ± 3.4 


Data 


13 




12 


18 




Pj 


< 5% 






MC Derived 


2.83 ± 0.51 




0.70 ± 0.12 


0.25 ± 0.05 


Wbb 


11.4 ± 3.6 




1.1 ± 0.3 


0.16 ± 0.05 


Wcc 


2.3 ± 0.9 




0.38 ± 0.15 


0.06 ± 0.03 


Wc 


0.97 ± 0.37 




0.16 ± 0.07 


0.03 ± 0.01 


Mistag 


2.7 ± 0.8 




0.65 ± 0.20 


0.15 ± 0.05 


Non-IV 


0.63 ± 0.34 




0.09 ± 0.05 


0.14 ± 0.09 


Total Background 


20.9 ± 5.0 




3.1 ± 0.6 


0.80 ± 0.15 


tt (11.6 pb) 


7.5 ± 1.5 




20.5 ± 3.7 


36.6 ± 6.1 


Data 


28 




22 
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TABLE XXXI: Sum; 


mary of the systematical uncertainties 


in the double tag analysis. 




Source 


Fractional Syst. Unccrt. (%) 


Contribution to 


°ti (%) 






Pj < 1% 


Pj < 5% 


Central Electron ID 


1.6 


+0.98/-0.96 


+0.98/-0.96 


Central Muon ID 


1.9 


+0.60/-0.60 


+0.61/-0.60 


CMX Muon ID 


1.8 


+0.21/-0.21 


+0.21/-0.21 


PDF 


2 


+2.1/-2.0 


+2.1/-2.0 


Jet Energy Scale 


4.2 


+4.5/-4.1 


+4.5/-4.1 


Lepton Isolation 


2 


+2.1/-2.0 


+2.1/-2.0 


ISR/FSR 


1.3 


+1.3/-1.3 


+1.3/-1.3 


MC Modeling 


1.6 


+1.7/-1.6 


+1.7/-1.6 


Z Vertex 


2.0 


+2.1/-2.0 


+2.1/-2.0 


Tagging SF Pj < 1% (b's/c's) 


8.6/12.9 


+20.3/-14.7 




Tagging SF Pj < 5% (b's/c's) 


8.5/12.7 




+18.3/-13.6 


Mistag Asymmetry Pj < 1% 


11.0 


+0.063/-0.063 




Mistag Asymmetry Pj < 5% 


15.5 




+0.44/-0.44 


Non-W Fraction 


50 


0.060 


0.092 


Non-W Prediction 


50 


0.13 


0.21 


W+HF Prediction 


30 


0.84 


1.0 


Cross Sections Bkg. 


1.8 


0.027 


0.030 


Luminosity 


5.9 


+6.4/-5.7 


+6.4/-5.7 


Total Systematic Uncertainty 




+22.2/-16.8 


+20.4/-15.9 



B. Cross Section Dependence on the Top in an analogous way to Section VIII A. Results are 

Quark Mass 

We study the dependence of the tt cross section 
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FIG. 17: Events with negative tagged jets compared to 
the prediction using the mistag matrix. We select events 
in the high-pr lepton sample with I} T < 20 GeV. Pj 
cuts of 1% and 5% are used on the top and bottom plots 
respectively. 



FIG. 18: Events with negative tagged jets (squares) in 
the pretag lepton+jets sample compared to the prediction 
using the mistag matrix before (empty circles) and after 
(empty triangles) heavy flavor corrections for Pj < 1% 
(5%) in the top (bottom) plot. 



shown in Fig. 22. A linear fit to the measured cross 
sections as a function of the top mass returns a 
slope of -0.096 ± 0.022 pb/(GeV/c 2 ) and -0.082 ± 
0.019 pb/(GeV/c 2 ) for Pj <1% and Pj <5%, respec- 
tively, where the uncertainties are due to Monte Carlo 
simulation statistics. As before, note that the fit re- 
sults for m t = 178 GeV/c 2 agree with the measured 
cross section within the 1.6% uncertainty estimated 
in Section VII due to different modeling in PYTHIA 
and HERWIG. 



C. Comparison Between Single and Double Tag 
Cross Sections 

Although the measurements of the single and dou- 
ble tag cross sections are statistically compatible, we 
observe a ratio of about 1.2 between the measured 
cross sections in the double and single tag samples. 
We use pseudo-experiments to estimate the probabil- 



ity to obtain a cross section greater than the mea- 
sured double tag cross section when we assume that 
the measured single tag cross section is correct. For 
each pseudo-experiment, we vary the total double tag 
background estimate according to a Gaussian distri- 
bution with a width equal to its uncertainty. Succes- 
sively, we add the background to the expected signal 
by assuming the single tag tb cross section and we 
vary the total number of events according to a Pois- 
son distribution. We repeat this procedure 10,000 
times and count the number of pseudo-experiments 
in which we have a result greater than the one ob- 
served in data. We find a probability of 13.2% for 
Pj < 1% and 15.6% for Pj < 5%. 

The systematic uncertainty in the double tag mea- 
surement is dominated by the uncertainties on the 
acceptance, luminosity and tagging scale factor. The 
systematic uncertainties on the background predic- 
tion are negligible. A bias on the values of accep- 
tance and luminosity would affect the cross section 
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FIG. 19: Single tag data and background contributions 
(for an integrated luminosity of 318 pb _1 ) as a function of 
the event jet multiplicity for Pj < 1% (top) and Pj < 5% 
(bottom). A top mass of mt = 178 GeV/c 2 is assumed. 
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FIG. 20: Top pair production cross sections as a func- 
tion of the top quark mass for Pj < 1% (top) and Pj < 
5% (bottom). The uncertainties shown are the statistical 
uncertainties on the acceptances for each mass. 



X. CONCLUSIONS 



measurement in the single and double tag samples 
in the same way. However, a bias on the tagging 
scale factor would have a greater effect in the dou- 
ble tag analysis than in the single tag one. To study 
this, we vary the tagging scale factor by ±lcr and 
we repeat the cross section measurements and the 
pseudo-experiments. Results are summarized in Ta- 
bles XXXII and XXXIII. As expected, the cross sec- 
tions measured in the double tag sample are more 
sensitive to a change in the scale factor, resulting in 
a better agreement between the single and double tag 
cross sections when a larger value for the scale factor 
is used. 



We present a measurement of the tt production 
cross section in pp collisions at y/s = 1.96 TeV with 
an integrated luminosity of 318 ± 18 pb _1 at the 
CDF detector. We select events compatible with the 
tt — > Ivqqbb decay mode by requiring one isolated 
electron (muon) with transverse energy Et (pt) > 
20 GeV and missing transverse energy I} T > 20 GeV 
and at least three jets with transverse energy Et > 
15 GeV. We further require at least one jet tagged by 
the jet probability algorithm. This selection accepts 
an estimated (3.5 ± 0.3)% of all tt events when a 
Pj < 1% cut is applied, and an estimated (4.4 ± 
0.4)% with a looser Pj cut at 5%. Backgrounds are 
estimated using data and Monte Carlo simulations. 
We find good agreement with the observed data in a 
control region defined by events with W+one or two 
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TABLE XXXII: Cross section for tt event production in single and double tag analysis for Pj < 1% and < 5% and 
different values of the tagging scale factors {SF). Results are expressed in pb. 





SF- 1 a 


SF 


SF + 1 a 


Pj < 1%, <1 tag 
Pj < 1%, <2 tags 
Pj < 5%, <1 tag 
Pj < 5%, <2 tags 


9.8ii;o(stat.)+^(syst.) 

13.3±^(stat.)±2:4( s y st -) 
10.5li;J(stat.)+i!(syst.) 

13.7ti;?(stat.)^2:4( s y st -) 


8.9i^(stat.)+^(syst.) 
ll.lt^(stat.)l"(syst .) 
9.6lJ;°(stat.)lo(syst.) 
ll.6li;J(stat.)i?;8(syst.) 


8.3+^(stat.)+^(syst.) 
9.4±?;?(Btat.)l?;2(Byst.) 
9.0iJ°(stat.)+lo(syst.) 
9.9^; 5 3 (stat.)+^( syst .) 
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FIG. 21: Double tag data and background contributions 
(for an integrated luminosity of 318 pb -1 ) as a function of 
the event jet multiplicity for Pj < 1% (top) and Pj < 5% 
(bottom). A top mass of mt = 178 GeV/c 2 is assumed. 



Slope = -0.096 ± 0.022 pb/(GeV/c ) 
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FIG. 22: Top pair production cross sections as a function 
of the top quark mass for Pj <1% (top) and Pj <5% 
(bottom) in the double tag analysis. The uncertainties 
shown are the statistical uncertainties on the acceptances 
for each mass. 



jets. Using the excess of events with three or more 
jets and at least one b tag with Pj < 1%, we measure 
a top pair production cross section of 



a tt - = 8.9±i;°(stat.)+^(syst.) pb. 
As cross checks, we measure the cross section using 



samples with different 6-tagging requirements. Using 
events with at least one b tag with Pj < 5% we obtain 

a tt - = 9.6l£;°(stat.)+^(syst.) pb. 

We also measure the ti production cross section in 
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i i i | i i i | i i i | i i i | i i i | i i i | i i i | i i i | i 

Lepton+Jete: Jet Prob b-Tag 



(L= 318 pb ) 



8.9±];S±l;5 



Lepton+Jets: SecVtx b-Tag 



(L=318pb ) 



8 7 + 0.9 + 1.1 
■'±0.9*0.9 



Lepton+Jets: Kinematic 



(L=347pb) 

Dilepton 

(L= 360 pb') 




6n + o.8 + i.o 
■ 0± 0.8 ±1.0 



8C + 2.6 + 0.7 
■ 5 ± 2.2 ± 0.3 



i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 



TABLE XXXIII: Probability to measure a cross section 
greater than the one obtained in the double tag analy- 
sis when the tt cross section measured in the single tag 
analysis is assumed. 





SF-la 


SF 


SF + 1 a 


Pj < 1% 


4.5% 


13.2% 


30% 


Pj < 5% 


2.8% 


15.6% 


35% 



events with at least two tagged jets. The acceptance 
for signal events is estimated to be (0.8 ± 0.1)% for 
P; < 1% and (1.5 ± 0.3)% for Pj < 5%. We measure 
a cross section of 

otf = ll.ll?j(stat.)+fj(syst.) pb 

for Pj < 1% and 

otf = 11.6l^(stat.)+?;^(syst.) pb 

for Pj < 5%. 

Figure 23 shows our main result together with 
other CDF tt cross section measurements and the- 
oretical predictions. Our result is above the central 
theoretical value by ~1.9 a. It should be noted that 
our result is highly correlated with the lepton+jets 
measurement using secondary vertex 6-tagging, de- 
scribed in [40], where a comparison between the jet 
probability and secondary vertex 6-taggers is given. 
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FIG. 23: Comparison of the tt production cross sec- 
tion measurement presented in this paper with theoret- 
ical predictions (solid band [1], dashed band [4]). Also 
shown are the CDF results obtained in the lepton+jets 
channel when using the secondary vertex tagger [40] and 
when no 6-tagging is applied (only kinematic informa- 
tion) [41], and the result obtained in the dilepton chan- 
nel [42]. All the measured cross sections are evaluated at 
m t = 178 GeV/c 2 . The grey (black) lines represent the 
statistical (total) uncertainties. For the dilepton analysis, 
the grey (black) lines represent the uncertainties coming 
from the fit (shape). 



APPENDIX: KINEMATIC DISTRIBUTIONS 

We compare the distributions for different kine- 
matic variables observed in data to the expectations 
for signal and backgrounds derived from a combina- 
tion of simulation and tt cross section measurements. 
Figures 24 to 27 show the results for the four sam- 
ples of events passing the selection criteria with at 
least three jets and one or two tags for Pj < 1% or 
Pj < 5%. The considered kinematic variables are the 
sum of the transverse energies of each object in the 
final state (Ht), the reconstructed transverse mass 



of the W boson, the missing transverse energy ($ T ) 
of the event, the Et of the lcpton, the transverse en- 
ergy of the tagged jets, and the pseudo-rapidity of 
the tagged jets with respect to the center of the de- 
tector. Kolmogorov-Smirnov (KS) probabilities are 
computed to test the agreement between observed 
and expected distributions. The distributions ob- 
served in the data are statistically consistent with 
the expected signal-plus-background distributions. 
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FIG. 24: Comparison of kinematic distributions in data to signal and background expectations for events passing 
the selection criteria with at least three jets and at least one tag for Pj < 1%. From the top-left corner: sum of 
the transverse energies of each object in the final state (Ht), reconstructed transverse mass of the W boson, missing 
transverse energy ($ T ), lepton Et, transverse energy of the tagged jets, and the pseudo-rapidity of the tagged jets 
with respect to the center of the detector. 
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FIG. 25: Comparison of kinematic distributions in data to signal and background expectations for events in the single 
tag sample (Pj < 5%). 
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FIG. 26: Comparison of kinematic distributions in data to signal and background expectations for events in the double 
tag sample (Pj < 1%). 
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FIG. 27: Comparison of kinematic distributions in data to signal and background expectations for events in the double 
tag sample (Pj < 5%). 
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